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Abstract 



Independence - the study of what is relevant to a given problem of reasoning - has 
received an increasing attention from the AI community. In this paper, we consider two 
basic forms of independence, namely, a syntactic one and a semantic one. We show features 
and drawbacks of them. In particular, while the syntactic form of independence is compu- 
tationally easy to check, there are cases in which things that intuitively are not relevant 
are not recognized as such. We also consider the problem of forgetting, i.e., distilling from 
a knowledge base only the part that is relevant to the set of queries constructed from a 
subset of the alphabet. While such process is computationally hard, it allows for a simplifi- 
cation of subsequent reasoning, and can thus be viewed as a form of compilation: once the 
relevant part of a knowledge base has been extracted, all reasoning tasks to be performed 
can be simplified. 

1. Introduction 

We successively present some motivations, the scope of the paper, and the contribution and 
the organization of the paper. 

1.1 Motivations 

In many situations of everyday life, we are confronted with the problem of determining what 
is relevant. Especially, as a preliminary step to various intelligent tasks (e.g., planning, de- 
cision making, reasoning), it is natural and reasonable to discard everything but what is 
relevant to achieve them efficiently. For instance, before starting to write this paper, we had 
to consider the relevant literature on the topic, and only it; gathering on our desks all the 
papers about relevance that we have at hand led us to set away our favourite cook books, 
because they are of no help to the task of writing this paper. The ability to focus on what 
is relevant (or dually to discard what is not) can be considered as a central, characteristic 
feature of intelligence; it is believed that over 90% of the neuronal connections of our brains 
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are inhibitory and serve to give up sensorial inputs (Subramanian, Greiner, &; Pearl, 1997). 
This explains why irrelevance, under various names as independence, irredundancy, influ- 
enceability, novelty, separability, is nowadays considered as an important notion in many 
fields of artificial intelligence (see a survey for more details, e.g., Greiner &; Subramanian, 
1995; Subramanian et al., 1997). 

In the following, we are concerned with relevance for reasoning. In this framework, 
the task is typically that of determining whether some piece of knowledge (a query) ip 
can be derived from a knowledge base S. Several reasoning schemes can be taken into 
account here, from the classical one (inference is classical entailment) to more sophisticated 
ones (common-sense inference). When dealing with such reasoning tasks, relevance is often 
exploited so as to make inference more efficient from a computational point of view. In other 
reasoning problems, the purpose is to explicitly derive some intensionally-characterized 
pieces of knowledge (e.g., tell me all what you know about Tweety). For such problems 
(that are not reducible to decision problems), relevance also has a role to play, for instance 
by allowing us to characterize the pieces of information we are interested in by forgetting 
the other ones. 

To what extent is the goal of improving inference reachable? In order to address this 
point, a key issue is the computational complexity one. Indeed, assume that we know that 
the resolution of some reasoning problems can be sped up once relevant information has been 
elicited. In the situation where it is computationally harder to point out such information 
from the input than to reason directly from the input, computational benefits are hard to 
be expected. If so, alternative uses of relevance for reasoning are to be investigated. For 
instance, searching for relevance information can be limited by considering only pieces of 
knowledge that can be generated in a tractable way. If such information depend only on the 
knowledge base, another possible approach is to (tentatively) compensate the computational 
resources spent in deriving the relevance information through many queries (computing 
pieces of relevant information can then be viewed as a form of compilation). 

1.2 Scope of the Paper 

Little is known about the computational complexity of relevance. This paper contributes 
to fill this gap. The complexity of several logic-based relevance relations is identified in a 
propositional setting. By logic-based we mean that the notions of relevance we focus on 
are not extra-logical but built inside the logic: they are defined using the standard logical 
notions of (classical) formula, model, logical deduction, etc. The stress is laid on notions of 
relevance that can prove helpful for improving inference and, in particular, the most basic 
form of it, classical entailment. Relevance is captured by relations in the metalanguage of 
the logic, that is, we formalize relevance as a relation between objects of the propositional 
language (formulas or sets of literals/variables) thus expressing the fact that some object is 
relevant to some other one. 

Two notions play a central role in this paper. The first one, (semantical) formula- 
variable independence (FV-independence for short) tells that a propositional formula S is 
independent from a given set V of variables if and only if it can be rewritten equivalently 
as a formula in which none of the variables in V appears. The second one is the notion 
of forgetting a given set V of variables in a formula S. It is intimately linked to the 
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notion of formula-variable independence because, as we show, the result of forgetting the 
set of variables F in a formula S can be defined as the strongest consequence of S being 
independent from V. Both notions have appeared in the literature under various names 
and with several different (but equivalent) definitions, and are highly useful for many tasks 
in automated reasoning and many problems in artificial intelligence: 

1. automated deduction and consequence finding. Formula- variable independence can be 
useful for checking (un)satisfiability or for consequence finding. Indeed, structuring 
the knowledge base by finding useful independencies may be worth doing before run- 
ning any search algorithm. This principle is at work in (Amir & Mcllraith, 2000). In 
optimal cases, for example, a satisfiability problem will be decomposed into a small 
number of satisfiability problems on easier knowledge bases (with less variables) as 
shown by Park and Gelder (1996). As to improving inference, this can be particu- 
larly helpful in the situation where the set of queries under consideration is limited 
to formulas ip that are (syntactically or semantically) independent from a set V of 
variables. 

2. query answering (see in particular Amir &: Mcllraith, 2000), diagnosis (see the work 
by Darwiche, 1998). Rendering a formula S independent from a set V of variables 
through variable forgetting gives rise to a formula that is query-equivalent to E w.r.t. 
V in the sense that every logical consequence of E that is independent from V 
also is a logical consequence of E once made independent from V, and the converse 
holds as well. Interestingly, the set of all the formulas independent from a set of 
variables is a stable production field (Siegel, 1987; Inoue, 1992), and focusing on 
such a production field is valuable for several reasoning schemes. For instance, in the 
consistency-based framework for diagnosis (Reiter, 1987), the queries we are interested 
in are the conflicts of the system to be diagnosed, i.e., the clauses that are independent 
from every variable used to represent the system, except the abnormality propositions 
used to encode the component failures. 

3. knowledge base structuring, topic-based reasoning. Formula-variable independence is 
a key notion for decomposing a propositional knowledge base (KB for short), i.e., 
a finite set of propositional formulas, into smaller subbases. Such a decomposition 
is all the more valuable as the number of variables the subbases depends on is low. 
Optimally, a knowledge base E = {ipi, is fully decomposable if it can be written 
as E = El U ... U E„ where E^ and Ej depend on disjoint sets of variables for all 
i / j. Such decompositions were considered in several papers (Parikh, 1996; Amir 
Sz Mcllraith, 2000; Marquis Sz Porquet, 2000) with somewhat different motivations. 
The most intuitive motivation for searching such decompositions is that it gives a 
better understanding of the knowledge base, by structuring it with respect to (possible 
disjoint but not necessarily) sets of topics (Marquis Sz Porquet, 2000). 

4. belief revision and inconsistency-tolerant reasoning. Decomposing a propositional 
knowledge base E into subbases {Ei,...,E„} proves also to be relevant for defin- 
ing inconsistency-tolerant relations as well as belief revision operators. The approach 
proposed by Chopra and Parikh (1999) proceeds as follows: the knowledge base E 



393 



Lang, Liberatore, & Marquis 



is first partitioned into such that the intersection of the languages of 

the subbases (i.e., the sets DepVarCEi) of variables the subbases depend on) are as 
small as possible; then ip is inferred from S w.r.t. the given partition if and only if 
the conjunction of all such that DepVar{T,i) f) DepVar{(p) / is consistent and 
entails (p. In a revision situation, this approach also ensures that the only old beliefs 
that may be thrown away are about the variables relevant to the input formula. Fi- 
nally, as recently shown by some of the authors of the present paper, forgetting can be 
advantageously exploited as a weakening mechanism for recovering consistency from 
an inconsistent KB (Lang & Marquis, 2002). 

5. belief update. A belief update operator maps a knowledge base E and an input formula 
ip expressing some explicit evolution of the world to a new knowledge base Soy?; S and 
So(/j respectively represent the agent's knowledge before and after the evolution of the 
world expressed by the update. Several authors argued that an update operator should 
preserve the part of the knowledge base not concerned by the update. This leads to the 
following three-stage process, proposed independently by Doherty, Lukaszewicz, and 
Madalinska-Bugaj (1998) and by Herzig and Rifi (1998) (and named MPMA and wssj,, 
respectively): (i) determine the variables relevant to the update, namely, DepVar{(p); 

(ii) forget these variables in E to obtain a new formula ForgetVar{T,, DepVar{(p)); 

(iii) expand by (p. In more compact terms, this update operator is expressed by 
Yloip = Forgety ar{Tj, DepVar{(p)) A (p. Thus, both our results on FV-independence 
and variable forgetting are relevant to the computational issues pertaining to this kind 
of belief updates. 

6. reasoning about action, decision making, planning. Logical languages for reasoning 
about action express the effects (deterministic or not, conditional or not) of actions by 
means of propositional formulas (Gelfond &; Lifschitz, 1993; Sandewall, 1995; Fargier, 
Lang, & Marquis, 2000; Herzig, Lang, Marquis, &: Polacsek, 2001); finding the vari- 
ables the effects are dependent on enables to identify the variables whose truth value 
may be changed by the action; moreover, formula-literal independence - a refinement 
of FV-independence that we introduce - also tells us in which direction (from false 
to true and/or from true to false) the possible change may occur. This is especially 
useful for filtering out irrelevant actions in a decision making or planning problem. 

7. preference representation. Logical languages for representing preference write elemen- 
tary goals as propositional formulas and it is crucial to identify those variables that 
have no influence on the agent's preference. Therefore, formula-variable independence 
has an important role to play; for instance, the framework of so-called ceteris paribus 
preference statements of Tan and Pearl (1994) interprets a preference item ip : ip > -n/; 
by: for any pair of worlds {oj, lj') such that (i) oj \= tp, (ii) lj' \= -tip and (iii) oj and 
Cjj' coincide on all variables ip and ip are independent from, then we have a strict 
preference of ui over w'. 

1.3 Contribution and Organization of the Paper 

The rest of the paper is structured as follows. After some formal preliminaries given in Sec- 
tion 2, the key notion of formula- variable independence is presented in Section 3. Because it 
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captures a very basic form of logic-based independence, this relation has already been intro- 
duced in the literature under several names, like influenceability (Boutilier, 1994), relevance 
to a subject matter (Lakemeyer, 1997), or redundancy (Doherty et al., 1998). Although it 
is conceptually and technically simple, this notion has not been studied in a systematic way, 
and our very first contribution aims at filling this gap, first by giving several equivalent char- 
acterizations of this notion (this is useful, as several papers introduced and used the same 
concepts under different names), and second by investigating carefully its computational 
complexity. In particular, we show that, in the general case, checking whether a formula is 
independent from a set of variables is coNP-complete. Then, we go beyond this very simple 
notion by introducing the more fine-grained notion of formula/literal independence in order 
to discriminate the situation where a formula S conveys some information about a literal 
but no information about its negation. This refinement is helpful whenever the polarity 
of information is significant, which is the case in many AI fields (including closed- world 
reasoning and reasoning about actions). We also study several interesting notions derived 
from formula- variable and formula-literal independence, such as the notion of simplified for- 
mula (S is Lit- (Var-) simplified if it depends on every literal (variable) occurring in it) and 
the corresponding process of simplifying a formula. Despite this complexity and because 
the size of a simplified formula can never be larger than the size of the original formula, 
simplification can prove a valuable relevance-based preprocessing for improving many forms 
of inference. 

In Section 4, we turn to the second key notion, namely forgetting a given set of variables 
in a formula (Lin & Reiter, 1994). The forgetting process plays an important role in many 
AI tasks and has been studied in the literature under various other names, such as vari- 
able elimination, or marginalization (Kohlas, Moral, &; Haenni, 1999). Several semantical 
characterizations and metatheoretic properties of forgetting are presented. Based on this 
notion, we introduce an additional notion of dependence between two formulas given a set 
of variables: we state that S is equivalent to $ w.r.t. V if and only if both formulas are log- 
ically equivalent once made independent from every variable except those of V . Here again, 
it is important to make a distinction between a variable and its negation (for instance, one 
can be interested in the positive conflicts of a system, only). For this purpose, we introduce 
a notion of literal forgetting that refines the corresponding notion of variable forgetting. We 
show how closed-world inference can be simply characterized from the corresponding equiv- 
alence relation. Finally, we identify the complexity of both notions of equivalence and show 
them to be hard (n2-complete). As a consequence, forgetting variables or literals within a 
formula cannot be achieved in polynomial time in the general case (unless the polynomial 
hierarchy collapses at the first level). We also show that a polysize propositional represen- 
tation of forgetting is very unlikely to exist in the general case. We nevertheless present 
some restricted situations where forgetting is tractable. 

Section 5 shows FV-independence closely related to notions of irrelevance already in- 
troduced in the literature by various authors. Section 6 discusses other related work and 
sketches further extensions of some notions and results studied in the paper. Finally, Sec- 
tion 7 concludes the paper. Proofs of the main propositions are reported in an appendix. 
A glossary of the notations is at the end of this paper, right before the bibliography. 
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2. Preliminaries 

We first recall some basic notions from propositional logic, and from complexity theory. 
2.1 Propositional Logic 

Let PS be a finite set of propositional variables. PROPps is the propositional language 
built up from PS, the connectives and the Boolean constants true and false in the usual 
way. For every V C PS, PROPy denotes the sublanguage of PROPps generated from the 
variables of V only. A literal of PROPy is either a variable of V (positive literal) or the 
negation of a variable of V (negative literal). Their set is denoted Ly, while Ly (resp. Ly) 
denotes the set of positive (resp. negative) literals built up from V. A clause S (resp. a 
term 7) of PROPy is a (possibly empty) finite disjunction (resp. conjunction) of literals of 
PROPy- A CNF (resp. a DNF) formula oi PROPy is a finite conjunction of clauses (resp. 
disjunction of terms) of PROPy. As usual, every finite set of formulas from PROPps is 
identified with the formula that is the conjunction of its elements. 

From now on, S denotes a propositional formula, i.e., a member oi PROPps- Var{T,) is 
the set of propositional variables appearing in E. If L C Lps, Var{L) is the set of variables 
from PS upon which literals of L are built. Elements of PS are denoted v, x, y etc. 
Elements of Lps are denoted li, I2 etc. Subsets of PS are denoted V, X, Y etc. In order 
to simplify notations, we assimilate every singleton V = {v} with its unique element v. The 
size of a formula E, denoted by is the number of occurrences of variables it contains. A 
propositional formula S is said to be in Negation Normal Form (NNF) if and only if only 
propositional symbols are in the scope of an occurrence of in S. It is well-known that 
every propositional formula E built up from the connectives A, V, -■, ^ only, can be turned 
in linear time into an equivalent NNF formula by "pushing down" every occurrence of in 
it (i.e., exploiting De Morgan's law) and removing double negations (since is involutive). 
Slightly abusing words, we call the formula resulting from this normalization process the 
NNF of S and we note Lit{Y,) the set of literals from Lps occurring in the NNF of S. For 
instance, the NNF of S = -i((-iaA6) Vc) is (aV-16) A-ic; hence, we have L«i(S) = {a, -^h, -ic}. 
Note that the NNF of a formula depends on its syntactical structure, i.e., two formulae that 
are synctatically different may have different NNFs even if they are equivalent. 

Full instantiations of variables of F C PS are called V-worlds^; they are denoted by 
u}y and their set is denoted Jly. When A and B are two disjoint subsets of PS, uja A ojp 
denotes the A U 5-world that coincides with uja on A and with uop oto. B. An interpretation 
(jj over PROPps is just a PS-world, and w is said to be a model of S whenever it makes S 
true. We denote Mo(i(S) the set of models of S. 

For every formula E and every variable x, S^j^o (resp. Yl^-i^i) is the formula obtained 
by replacing every occurrence of in E by the constant false (resp. true). E^^i (resp. 
E;<_o) is an abbreviation for Tix^-i (resp. Ea;^o) when / is a positive literal x and for E^j^o 
(resp. Ejj^i) when I is a negative literal -^x. 

Given an interpretation w and a literal we let Force{Lj,l) denote the interpretation 
that gives the same truth value as w to all variables except the variable of I, and such 
that Force{u),l) \= I. In other words, Force{u),l) is the interpretation satisfying / that is 

1. The 1^-worlds are also called partial models over V . 
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the closest to uj. For instance, provided that PS = {a J)} and a;(a) = a;(6) = 1, we have 
Force{uj,-'b){a) = 1 and Force{uj,-'b){b) = 0. Clearly, if lj \= I then Force{LjJ) = lj. If 
L = {li, . . . , In} is a consistent set of literals, then Force{LO, L) is defined as 

Force{. . . {Force{LJ, /i), . . .), /„). 

Lastly, given an interpretation lj and a variable x, we let Switch{(jj, x) denote the interpre- 
tation that gives the same truth value as a; to all variables except x, and that gives to x 
the value opposite to that given by lj. 

Two formulas * and $ are said to be equivalent modulo a formula S if and only if 
S A * = S A 

In this paper we use the concepts of prime implicates and prime implicants. The set of 
prime implicates of a formula S, denoted by /P(S), is defined as: 

/P(S) = {6 clause | S |= 5 and ^6' clause s.t. S |= 6' and 6' ^ 6 and 6 ^ 6'}. 

Among all the implicates of S (i.e., the clauses entailed by S), the prime implicates 
of S are the minimal ones w.r.t. |= (i.e., the logically strongest ones). The set of prime 
implicants of a formula S, denoted by P/(E), is defined dually as: 

P/(S) = {7 term | 7 |= S and jBj' term s.t. 7' |= S and 7 |= 7' and 7' ^ 7}. 

Among all the implicants of S (i.e., the terms implying E), the prime implicants of S 
are the maximal ones w.r.t. |= (i.e., the logically weakest ones). 

Of course, the set of prime implicants/ates may contain equivalent terms/clauses. We 
can restrict our attemption to one term/clause for each set of equivalent terms/clauses. 
Stated otherwise, in P/(S) and /P(S), only one representative per equivalence class is 
kept. 

Example 1 Let T, = {a \/ b, A c ^ e, d 4^ e} . The set of prime implicates of S is, by 
definition, 

/P(S) = {a V 6, a V V e, ^c? V e, c? V ^e, a V V d}. 
2.2 Computational Complexity 

The complexity results we give in this paper refer to some complexity classes which deserve 
some recalls. More details can be found in Papadimitriou's (1994) textbook. Given a 
problem A, we denote by A the complementary problem of A. We assume that the classes 
P, NP and coNP are known to the reader. The following classes will also be considered: 

• BH2 (also known as DP) is the class of all languages L such that L = Li n L2, where 
Li is in NP and L2 in coNP. The canonical BH2-complete problem is SAT-UNSAT: a 
pair of formulas {<p,ip) is in SAT-UNSAT if and only if (p is satisfiable and tp is not. 
The complementary class C0BH2 is the class of all languages L such that L = Li U L2, 
where Li is in NP and L2 in coNP. The canonical coBH2-complete problem is SAT- 
OR-UNSAT: a pair of formulas {ip, tp) is in SAT-OR-UNSAT if and only if ip is satisfiable 
or ip is not. 
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• A2 = P'^'' is the class of all languages recognizable in polynomial time by a deter- 
ministic Turing machine equipped with an NP oracle, i.e., a device able to solve any 
instance of an NP or a coNP problem in unit time. FA2 is the corresponding class of 
function problems. 

• S2 = NP'^'^ is the class of all languages recognizable in polynomial time by a nonde- 
terministic Turing machine equipped with an NP oracle. The canonical S2-complete 
problem 2-QBF is the set of all triples {A = {ai, ...,«„}, S = {61, where 
A and B are two disjoint sets of propositional variables and $ is a formula from 
PROPaub- a positive instance of this problem is a triple {A, B, for which there 
exists a ^4- world lja such that for all S-world ljb have A |= ^■ 

• 112 = C0II2 = coNP'^'^. The canonical n2-complete problem 2-QBF is the set of all 
triples {A = {ai, 0^}) -B = {61, where A and B are two disjoint sets of 
propositional variables and $ is a formula from PROPaub- A positive instance of this 
problem is a triple {A,B,^) such that for every y4-world uja there exists a S-world 
ljb for which lja A ljb \= ^■ 

E2 and 112 complexity classes located at the so-called second level of the polynomial 
hierarchy, which plays a prominent role in knowledge representation and reasoning. 

3. Formula-Literal and Formula- Variable Independence 

Formula-literal and formula-variable independence capture some forms of independence 
between the truth values of variables and the possible truth values of a formula. Roughly 
speaking, S is dependent on / because it tells one something positive about more precisely, 
there is a contest, i.e., a conjunction of literals, that, added to to S enables one to infer 
I. For instance, S = (a =^ 6) is dependent on b, since b can be inferred from E under the 
assumption that a is true. Of course, we cannot assume contexts that are inconsistent with 
E, and we cannot assume / itself. A formula E will be considered as independent from 
variable x if and only if it is both independent from x and independent from -^x. Dually, 
we can interpret both forms of dependence as aboutness relations; when S is dependent on 
a literal I, it tells one something about I, and when E is dependent on a variable x, it tells 
something about x or about -^x. 

3.1 Syntactical Independence 

The easiest way to define dependence between a formula E and a literal I is by assuming 
that E, when put into NNF, contains I. Reminding that Lii(E) is the set of literals that 
occur in the NNF of E, this can be formally expressed by the following definition: 

Definition 1 (syntactical FL-independence) Let Yl be a formula from PROPps, I a 
literal of Lps, and L a subset of Lps- 

• E is said to be syntactically Lit-dependent on I (resp. syntactically Lit-independent 
from I) if and only if I G Lit{'E) (resp. I ^ Lit{T,)). 
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• Ti is said to be syntactically Lit-dependent on L if and only if there is a I & L such 
that S is syntactically Lit-dependent on I. Otherwise, S is said to be syntactically 
Lit-independent from L. 

Prom this definition it follows immediately that E is syntactically Lit-independent from 
L if and only if L«t(E) fl L = 0. Thus, in order to determine whether a formula S is 
syntactically Lit-dependent on a set L of literals, it suffices to check whether there exists a 
literal in L that occurs in the NNF of S. 

Example 2 Let ^ = -^{a A h). The NNF of $ is (-■« V ^h); therefore, $ is syntactically 
Lit-dependent on both -la and -16, while it is syntactically Lit-independent from a and h. 

As this example illustrates, a propositional formula can easily be syntactically Lit- 
independent from a literal while syntactically Lit-dependent on its negation. This is as 
expected, since S may imply / in some context, while it may be always impossible to derive 
-1/. In other words, the set of literals S is syntactically Lit-independent from is not closed 
under negation. Interestingly, a notion of syntactical formula- variable independence can be 
defined from the more basic notion of syntactical FL-independence. 

Definition 2 (syntactical FV-independence) Let he a formula from PROPps, v a 
variable of PS, and V a subset of PS. 

• E is said to be syntactically Var-dependent on v (resp. syntactically Var-independent 
from v) if and only if v € Far(S) (resp. v Var{T,)). 

• T, is said to be syntactically Var-dependent on V if and only if there is a variable v 
in V s.t. S is Var-dependent on it, i.e., if and only if Var{Yl) ClV ^ Otherwise, S 
is said to be syntactically Var-independent from V. 

Example 3 Let S = (a A -16). S is syntactically Var-dependent on a and on b and syntac- 
tically Var-independent from c. 

Syntactical FV-independence can be easily expressed as syntactical FL-independence: 
S is syntactically Var-independent from V if and only if it is syntactically Lit-independent 
from V U {-la; | x G V}. Clearly enough, both syntactical formula-literal independence and 
syntactical formula-variable independence can be checked in linear time. 

However, these basic forms of independence suffer from two important drawbacks. First, 
they do not satisfy the principle of irrelevance of syntax: two equivalent formulas are 
not always syntactically independent from the same literals/variables. Second, syntacti- 
cal dependence does not always capture the intuitive meaning of dependence: for instance, 
S = (a A -16 A (a V b)) is syntactically Lit-dependent on a, -16, b; since -16 can be derived 
from S, S is about -16 in some sense. Contrastingly, there is no way to derive b from S, 
unless producing an inconsistency. 

Handling such a separation requires a more robust notion of independence, to be intro- 
duced in the following section. 
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3.2 Semantical Independence 

We now give a semantical definition of independence, wfiicfi does not suffer from the afore- 
mentioned drawbacks, i.e., a definition that does not depend on the syntactical form in 
which formulas are expressed. We will prove that this semantical definition of indepen- 
dence does not suffer from the second drawback of syntax independence, i.e., a formula 
that is semantically dependent on a literal always enables one to derive the literal in some 
context. 

Definition 3 ((semantical) FL-independence) LetH be a formula from PROPps, I G 
Lps, and L a subset of Lps- 

• T, is said to be Lit-independent^ from I, denoted I ^ T,, if and only if there exists a 
formula $ s.i. $ = S and $ is syntactically Lit-independent from I. Otherwise, S is 
said to be Lit-dependent on I, denoted I E. The set of all literals of Lps such that 
I i-^ T, is denoted by DepLit{T,) . 

• S is said to be Lit-independent from L, denoted Lt/^Y^, if and only if LnDepLit{Yl) = 
0. Otherwise, S is said to be Lit-dependent on L, denoted L T,. 

Simply rewriting the definition, E is Lit-independent from L if and only if there exists a 
formula <I? s.t. <l> = S and $ is syntactically Lit-independent from L. Thus, FL-independence 
is not affected by the syntactic form in which a formula is expressed, that is, replacing S with 
any of its equivalent formulas does not modify the relation i-^. Since S is Lit-independent 
from L if and only if E can be made syntactically Lit-independent from L while preserving 
logical equivalence, it follows that syntactical Lit-independence implies Lit-independence, 
but the converse does not hold in the general case. 

Example 4 Xei E = (a A -.6 A (a V b)). We have DepLit{T) = {a, -.6). Note that E is 
Lit-independent from b because it is equivalent to ^ = {a A -^b), in which b does not appear 
positively. 

As in the case of syntactical independence, we can formalize the fact that a formula E 
has some effects on the truth value of a variable v. Indeed, we define a notion of (semantical) 
formula- variable independence, which can also be easily defined from the (semantical) notion 
of FL-independence. 

Definition 4 ((semantical) FV independence) LetTi be a formula from PRO Pps, v G 
PS, and V a subset of PS. 

• E is said to be Var-independent from v, denoted v E, if and only if there exists a 
formula $ s.t. $ = E and $ is syntactically Var-independent from v. Otherwise, E is 
said to be Var-dependent on v, denoted v h^I E. We denote by DepVar{T,) the set 

of all variables v sucli that v E. 

2. In order to avoid any ambiguity, wc will refer to the syntactical forms of independence explicitly from 
now on; in other words, independence must be taken as semantical by default in the rest of the paper. 
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• Ti is said to be Var-independent from V, denoted V S, if and only if V r\ 

DepVar{T,) = 0. Otherwise, S is said to be Var-dependent on V , denoted V S. 

Clearly, S is Var-independent from V if and only if there exists a formula $ s.t. $ = E 
and $ is syntactically Var-independent from V . Moreover, Var-independence is to Lit- 
independence as syntactical Var-independence is to syntactical Lit-independence; indeed, S 
is Var-independent from V if and only if S is Lit-independent from Ly- 

Example 5 Let S = (a A -ife A (a V c)). We have DepVar{'S) = {a,b}. Note that S is 
Var-independent from c. 

The definition of semantical FL-independence is based on the set of literals of formulas 
equivalent to S. Intuitively, this is the easiest way to define a notion of independence that 
is not dependent on the syntax. However, proving theorems directly from this definition 
is not so easy. For instance, we will prove that determining whether a formula S is Lit- 
dependent on literal / is in NP, but this result cannot be directly proved from Definition 
3, since checking all possible formulas equivalent to S cannot be done with a polynomial 
non-deterministic guessing. We give now a semantical characterization of FL-independence. 

Proposition 1 A formula S is Lit-independent from literal I if and only if, for any inter- 
pretation LJ G 0,ps, if LJ \= Ti then Force{uj, -■/) |= E. 

As a direct corollary, we get that S is Lit-independent from L if and only if for any 
literal / E L and any interpretation cj G ^ps, if w |= S then Force{cu. -■/) |= S. 

This property gives an idea of how FL-dependence works. Indeed, if S is Lit-dependent 
on a literal then there exists an interpretation a; such that w |= E and Force{u), -■/) ^ E, 
which means that (a) uj \= I and (b) the literal Hn w is "really needed" to make uj a model 
of S, that is, the partial interpretation obtained by removing / in oj does not satisfy S. 

This property also explains why FL-dependence formalizes the concept of "true in some 
context" , as explained at the beginning of this section. Indeed, S is Lit-dependent on / if 
and only if there is some context (consistent with E, and that does not imply /), in which 
S implies /. This can be proved from the above proposition: if a; is an interpretation such 
that w 1= S but Force{ijj, -■/) ^ S, then the term: 

= ^{{x G PS I w 1= S A a;} U {^x \ x e PS and w |= E A ^2;})\{/} 

is consistent with S (by construction, 7 is equivalent to the disjunction of a term equivalent 
to u) with a term equivalent to Force{u), -■/)); it also holds that 7 A S |= while 7 ^ that 
is, 7 is a context in which S implies I. 

Moreover, Proposition 1 shows that our notion of Lit-independence coincides with the 
notion of (anti-)monotonicity (Ryan, 1991, 1992). To be more precise, a (consistent) formula 
is said to be monotonic (resp. antimonotonic) in variable v if and only if it is Lit-independent 
from -^v (resp. from v). Interestingly, a subclassical^ inference relation, called natural 
inference^ has been defined on this ground (Ryan, 1991, 1992). Basically, a formula 
is considered as a consequence of a formula S if and only if it is a logical consequence 

3. That is, a restriction of classical entailment |=. 
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of it, and S Lit-independence of a literal implies its Lit-independence. Accordingly, 
natural inference prevents us from considering p V g as a consequence of p (it is a relevant 
implication). All our characterization results about Lit-independence, including complexity 
results, have an immediate impact on such a natural inference relation (especially, they 
directly show that the complexity of natural inference is in A2). 

Proposition 1 is easily extended to formula-variable independence: 

Corollary 1 A formula S is Var-independent from variable v if and only if, for any 
interpretation u) G ^ps, we have a; |= S i/ and only if Switch{(jj,v) |= E. 

The following metatheoretic properties of FL-independence are used in the rest of this 
section. 

Proposition 2 

(1) DepLit{i:) C L'it(S); 

(2) IfT. = then DepLit{^) = DepLit{^); 
(3a) DepLit{T. A $) C DepLit{T) U DepLit{^); 
(3b) DepLit{E V C DepLit{E) U DepLit{^); 
(4) I ^ DepLitCS) if and only if ^l G DepLit{-^'T). 

FV-independence exhibits similar properties, plus negation stability (point (4) below), 
which is not satisfied by FL-independence. 

Proposition 3 

(1) DepVar{T.) C Far(S); 

(2) IfY. = ^, then DepVar{T,) = DepVar{<^>); 
(3a) DepVar{T, A $) C DepLit{T,) U DepLit{^); 
(3b) DepVar{T. V <!>) C DepLit{J:) U DepLit{^); 
(4) DepVar{-^T,) = DepVar{T,). 

Beyond these properties, FL-independence and FV-independence do not exhibit any 
particularly interesting structure. In particular, FL-independence and FV-independence 
neither are monotonic nor anti-monotonic w.r.t. expansion of E (strengthening or weakening 
E can easily make it no longer independent from a set of literals or variables). 

Recall that L 1-^ S if and only ii LriDepLit{Yl) / 0. This means that the Lit-dependence 
of S on L only implies the Lit-dependence on a literal of L, not a "full" Lit-dependence on 
any literal of L. In other words, if we want to check whether a formula S is Lit-dependent on 
any literal of L, we need a notion stronger than FL-dependence, called full FL-dependence. 
The same can be said for FV-dependence. 

Definition 5 (full FL/FV-dependence) Let S be a formula from PROPps, L be a sub- 
set of Lps and V be a subset of PS. 

• S is fully Lit-dependent on L if and only if L C DepLit{T,). 

• H is fully Var-dependent on V if and only if V C DepVar{T,). 
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Example 6 S = (aA-i6A(6V-i6)) is fully Lit- dependent on {a, -16} and fully Var-dependent 
on {a,b}. Contrastingly, S is not fully Lit-dependent on {a,b}. 

Clearly enough, whenever S is fully Lit-dependent on L, it is also fully Var-dependent 
on Var{L). However, the converse does not hold, as the previous example shows. 

While full FL-dependence (resp. full FV-dependence) can be checked in linear time 
once DepLit{T,) (resp. DepVar{T,j) is known, at the end of the section we prove that 
determining these sets is NP-hard, and that deciding full FL-dependence (as well as full 
FV-dependence) is NP-complete. 

Let us now consider the particular case of full FL-dependence when L = Lit{Tj), or the 
full FV-dependence when V = Far(E). If full dependence holds in these cases, we say 
that S is Lit-simplified, or Var-simplified, respectively. Var-simplification is achieved when 
E contains no occurrence of any variable it is Var-independent from. Lit-simplification 
corresponds to the more restricted situation where the NNF of E does not contain any 
occurrence of a literal it is Lit-independent from. 

If a formula E is not Lit-simplified (resp. Var-simplified), then there is some literal (resp. 
variable) that occurs in the NNF of E, but E is not Lit-dependent (resp. Var-dependent) on 
it. This means that the syntactic form in which E is expressed contains a literal or variable 
that is indeed useless. 

Definition 6 (simplified formula) Let H be a formula from PROPps- 

• Tj is Lit-simplified if and only if Lit{T,) = DepLit{T,). 

• H is Var-simplified if and only i/Far(S) = DepVarCS). 

As the following example illustrates, every formula that is Lit-simplified also is Var- 
simplified but the converse does not hold in the general case. 

Example 7 E = (a A -16 A (6 V -16) A (a Vc)) neither is Lit-simplified nor Var-simplified. The 
equivalent formula (a A -16 A (6 V -16)) is Var-simplified but it is not Lit-simplified. Finally, 
the equivalent formula {a A -16) is both Lit-simplified and Var-simplified. 

Simplified formulas do not incorporate any useless literals or variables. As the next 
proposition shows it, the notion of simplified formula actually is the point where syntactical 
independence and (semantical) independence coincide. 

Proposition 4 Let H be a formula from PROPps. 

• E is Lit-simplified if and only if the following equivalence holds: for every L C Lps, 
E is syntactically Lit-independent from L if and only if L i/^ T, holds. 

• E is Var-simplified if and only if the following equivalence holds: for every V C PS, 
E is syntactically Var-independent from V if and only if V S holds. 

Thus, while a formula can easily be Lit-independent from a set of literals without be- 
ing syntactically Lit-independent from it, simplification is a way to join Lit-independence 
with its syntactical restriction (which is easier to grasp, and as we will see soon, easier 
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to check in the general case): Lit-independence and syntactical Lit-independence coincide 
on Lit-simplified formulas. The same holds for Var-independence and syntactical Var- 
independence. 

The strength of the notion of simplification lies in the fact that every formula can be 
simplified preserving its models. This is useful, as simplified formulas can be shorter and 
easier to understand than formulas containing useless literals. 

Proposition 5 For every S from PROPps, there exists a Lit-simplified formula $ s.t. 
E = 

Since Lit-simplified KBs are also Var-simplified, this proposition also shows that every 
KB can be Var-simplified without modifying its set of models. 

Interestingly, both FL-independence and FV-independence can be characterized with- 
out considering the corresponding syntactic notions of independence, just by comparing 
formulas obtained by setting the truth value of literals we want to check the dependence. 

Proposition 6 Let T, be a formula from PROPps and I he a literal of Lpg- The next four 
statements are equivalent: 

(1) I ^ E; 

(2) S^^i 1= s,^o; 

(3) s 1= S;^o; 

(4) 1= S. 

The above properties can be used to check whether a formula is Lit-dependent on a 
literal, as the following example shows. 

Example 8 S = (a A -ife A (6 V is Lit-independent from b since replacing b by true 
within S gives rise to an inconsistent formula. Contrastingly, S is Lit-dependent on -i6 
since replacing b by false within S gives S6<_o = a, which is not at least as logically strong 
as the inconsistent formula obtained by replacing b by true within E. 

A similar proposition holds for FV-independence, characterizing the variables a formula 
S depends on, using the formulas Tix-i-o and Tix-i-i- 

Proposition 7 Let T, be a formula from PROPps and x be a variable of PS. The next 
four statements are equivalent: 

(1) X S; 

(2) T^x^Q = ^x^l', 

(3) S = Sa^^o; 

(4) S = '^x<-i- 

As in the case of literal dependence, the above property can be used to find out whether 
a formula is Var-dependent on a variable. 

Example 9 S = (aA (6V-i6)) is Var-independent from b since we have S^^q = Sh^i = a. 
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Interestingly, FL-independence and FV-independence can be determined in an effi- 
cient way when E is given in some specific normal forms, namely, prime implicate nor- 
mal form or prime implicant normal form. For such normal forms, Lit-independence and 
Var-independence come down to their corresponding syntactical forms. 

Proposition 8 Let Tj be a formula from PROPpg and L be a subset of Lpg- The next 
statements are equivalent: 

(1) L S; 

(2) PI{Ti) C {7 I 7 is a term that does not contain any literal from L}; 

(3) IP{Y1) C {6 \ 6 is a clause that does not contain any literal from L}. 

Proposition 9 Let H be a formula from PROPps and V be a subset of PS. The next 
statements are equivalent: 

(1) V S; 

(2) PI{Ti) C {7 I 7 is a term that does not contain any variable from V}; 

(3) IP{^) C {5 I 5 is a clause that does not contain any variable from F}. 

Example 10 Let 12 = (a A^6A (6V^6)). WehavePI{^) = {{aA^b)} andIP{T,) = {a,^b} 
(up to logical equivalence). We can easily observe that S is Lit- independent from b looking 
at -P/(S) (or IP{Ti)). We can also easily state that S is Lit-dependent on -^b and Var- 
independent from c by considering any of these normal forms. 

Proposition 7 shows that a KB can be easily Var-simplified (i.e., in polynomial time) 
as soon as the variables it is Var-independent from have been determined. Indeed, we can 
easily design a greedy algorithm for simplifying KBs. This algorithm consists in considering 
every variable x of Far(S) in a successive way, while replacing S by Sa;^o whenever S is 
Var-independent from x. This algorithm runs in time polynomial in the size of E once the 
variables S is Var-independent from have been computed. At the end, the resulting KB is 
Var-simplified. 

Lit-simplifying a KB is not so easy if no assumptions are made about its syntax, due 
to the fact that literals signs are crucial here. Indeed, looking only at the occurrence of a 
variable inside a formula is not sufficient to state whether or not this is a positive occurrence 
(or a negative one). Fortunately, turning a formula into its NNF is computationally easy (as 
long as it does not contain any occurrence of connectives like -44> or ®) and it proves sufficient 
to design a greedy algorithm for Lit-simplifying a KB when the literals it is Lit-independent 
from have been identified. Indeed, within an NNF formula, every literal can be considered 
easily as an atomic object. This algorithm consists in considering every literal I of Lit{T,) 
in a successive way, while replacing every occurrence of / in S by false considering every 
literal of Lit{Yj) as an atom. Stated otherwise, when / is a positive (resp. negative) literal 
X (resp. -ix), replacing / by false does not mean replacing -^x (resp. x) by true: only the 
occurrences of I with the right sign are considered. This algorithm runs in time polynomial 
in the size of S once the literals S is Lit-independent from have been computed. At the 
end, the resulting KB is Lit-simplified. 
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3.3 Complexity Results 

While syntactical FL and FV- dependence can be easily checked in linear time in the 
size of the input, this is far from being expected for (semantical) FL-dependence and FV- 
dependence in the general case: 

Proposition 10 (complexity of FL/FV-dependence) FL DEPENDENCE, FV depen- 
dence, FULL FL DEPENDENCE and FULL FV DEPENDENCE are NP-complete. 

Thus, although they look simple, the problems of determining whether a formula is 
independent from a literal or a variable are as hard as checking propositional entailment. 
Interestingly, the complexity of both decision problems fall down to P whenever checking 
(in) dependence becomes tractable. Apart from the case of syntactical independence, some 
other restrictions on S makes (in) dependence testable in polynomial time. Especially, we 
get: 

Proposition 11 Whenever E belongs to a class C of CNF formulas that is tractable for 
clausal query answering (i.e., there exists a polytime algorithm to determine whether S |= 7 
for any CNF formula j) and stable for variable instantiation (i.e., replacing in T, E C any 
variable by true or by false gives a formula that still belongs to C) then FL dependence, 
FV dependence, FULL FL DEPENDENCE and FULL FV DEPENDENCE are in P. 

In particular, when S is restricted to a renamable Horn CNF formula or to binary 
clauses (Krom formula), all four decision problems above belong to P. 

We have also investigated the complexity of checking whether a formula is Lit-simplified 
(and Var-simplified) : 

Proposition 12 LiT-siMPLiFiED FORMULA and Var-simplified formula are NP-complete. 

All these complexity results have some impact on the approaches that explicitly need 
computing DepVar{Il) as a preprocessing task. Namely, we have the following result: 

Proposition 13 

1. Determining whether DepLitiY,) = L (where L is a set of literals), and determining 
whether DepVariJ^) = X (where X is a set of variables) is BH2-complete. 

2. The search problem consisting in computing DepLit{Yj) (respectively DepVar{T,)) is 
in FA2 and is both NP-hard and coNP-hard. 

3.4 Discussion 

The previous characterizations and complexity results lead to several questions: when is 
it worthwhile to preprocess a knowledge base by computing independence relations? How 
should these independence relations be computed? What is the level of generality of the 
definitions and results we gave in this section? 
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Many equivalent characterizations of formula- variable independence have been given in 
the literature, each of which could serve as a definition (Definition 4, Corollary 1, each of the 
statements (2), (3) and (4) in Proposition 7 and of the statements (2), (3) in Proposition 9), 
so one may wonder which one has to be used in practice. 

In many papers referring explicitly to formula-variable independence, the prime impli- 
cant/cate characterization (Proposition 9) is used as a definition (Boutilier, 1994; Doherty 
et al., 1998). Generally speaking, this is not the cheapest way to compute the set of vari- 
ables a formula depends on, since the size of -PI(S) is exponential in the size of E in the 
worst case. This characterization is to be used in practice only if the syntactical form of 
S is such that its prime implicants or prime implicates can be computed easily from it 
(this is the case for instance whenever E is a Krom formula). Clearly, the cheapest way to 
compute formula- variable independence consists in using any of the equivalent formulations 
of Proposition 7, which all consist of validity tests. 

Checking whether a formula S is Var-independent from a variable x is coNP-complete, 
which implies that simplifying a knowledge base by getting rid of redundant variables needs 
|Far(E)| calls to an NP oracle, and is thus in FA2 and not below (unless NP = coNP). This 
may look paradoxical (and sometimes useless) to preliminarily compute several instances 
of a NP or coNP-hard independence problem to help solving a (single) instance of a NP 
or coNP-complete problem. However, this negative comment has a general scope only, 
and in many particular cases, this can prove quite efficient (indeed, even when S has no 
particular syntactical form, the satisfiability or the unsatisfiability of T,x<^o A -''Ex<^i may 
be particularly easy to check). Furthermore, if the knowledge base S is to be queried many 
times, then the preprocessing phase consisting in Var-simplifying E by ignoring useless 
variables is likely to be worthwhile. 

4. Forgetting 

In this section, we define what forgetting is, present some of its properties, and finally give 
some complexity results. 

4.1 Definitions and Properties 

A basic way to simplify a KB w.r.t. a set of literals or a set of variables consists in forgetting 
literals/variables in it. Beyond the simplification task, forgetting is a way to make a formula 
independent from literals/variables. Let us first start with literal forgetting: 

Definition 7 (literal forgetting) Let T, be a formula from PROPps and L he a subset 
of Lps- For get Lit (Ti, L) is the formula inductively defined as follows: 

1. ForgetLtt{J^,$) = S, 

2. ForgetLit{T,, {I}) = V {-^l A S), 

3. ForgetLit{T., {1} U L) = ForgetLit{ForgetLit{T., L), {/}). 
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This definition is sound since the ordering in which hterals of L are considered does 
not matter^. We can also prove that the definition above is equivalent to the one in which 
Point 2. is replaced by 

ForgetLiti^, {I}) = V A E;^o) 

just because S and Si<_o are equivalent modulo -iL 

Let us now give a semantical characterization of literal forgetting. If L = {/}, that is, L 
is composed of a single literal, then forgetting I from a formula S amounts to introducing 
the model Force{uj, -^l) for each model a; of E such that u \=l. 

Proposition 14 The set of models of ForgetLit{Yl, {I}) can be expressed as: 

Mod{ForgetLit{T., {I})) = Mod{T.) U {Force(w, -^l) | w |= S} 

= {a; I Force{u!, I) |= S} 

A similar statement can be given for the case in which L is composed of more than one 
literal. In this case, for each model uj of E, we can force any subset of literals Li C L such 
that CjO 1= Li to assume the value false. 

Proposition 15 The set of models of ForgetLit{Ti, L) can he expressed as: 
Mod{ForgetLit{T,,L)) = {u; \ Force{u),Li) \= E where Li C L} 
As a corollary, we obtain the following properties of literal forgetting: 

Corollary 2 Let S, $ be formulas from PROPpg and ^1,^2 ^ Lpg- 

• E \= ForgetLit{'>l,Li) holds. 

• 7/ S 1= $ holds, then ForgetLit{Y,, Li) |= ForgetLit{^, Li) holds as well. 

• If Li C L2 holds, then ForgetLit{T,,Li) \= ForgetLit{I], L2) holds as well. 
Let now consider an example. 

Example 11 Let E = (^a V 6) A (a V c). We have ForgetLit{T,, {^a}) = (a V c) A (6 V c). 

The key proposition for the notion of forgetting is the following one: 

4. The proof of this statement is almost straightforward: let h, I2 be two literals; 

1. if h I2 and h then ForgetLit{ForgetLit{T.,h),h) = V (-./i A S);^^! V {^h A 
E(i<_i) V (-1/1 A A S) = S(i^i,(2^i V (-1/1 A Sj^^i) V {^h A V (-1/1 A -1Z2 A S) is symmetric 
in h and h; 

2. if I2 = = then FoTgetLit{ForgetLit{T., I), = (E(<_i V (-./ A E(<_o))(<-o V (/ A (E(^i V (-./ A 
Ei<_o))i<-o) = Ei<_i V Ei<_o is symmetric in / and -■/; 

3. the case h = I2 is trivial. 
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Proposition 16 Let H be a formula from PROPps ond L C Lps- ForgetLit{Ti,L) is the 
logically strongest consequence of E that is Lit-independent from L (up to logical equiva- 
lence). 

The following immediate consequence of Proposition 16 establishes strong relationships 
between literal forgetting and Lit-independence: 

Corollary 3 Let T, be a formula from PROPps and L C Lps- is Lit-independent from 
L if and only ifT, = ForgetLit{Yl, L) holds. 

The following one gives an immediate application of literal forgetting: 

Corollary A If a formula ip is Lit-independent from L, then S |= </? holds if and only if 
ForgetLit{T,, L) |= (p. 

This result proves that forgetting literals from L does not affect entailment of formulas 
that are Lit-independent from L. This is in some sense analogous to the concept of filtration 
in modal logics (Goldblatt, 1987); indeed, if we are interested in knowing whether T, \= ip 
only for formulas ip that are Lit-independent from L, then the literals of L can be forgotten 
in S. 

Let us now investigate the computation of ForgetLit{T,,L). Let us first consider DNF 
formulas S. Forgetting literals within DNF formulas is a computationally easy task. On the 
one hand, forgetting literals within a disjunctive formula comes down to forgetting them in 
every disjunct: 

Proposition 17 Let S, $ be two formulas from PROPps and L C Lps- 
ForgetLit{E V L) = ForgetLit{T,, L) V ForgetLit{^, L). 

On the other hand, forgetting literals within a consistent term simply consists in remov- 
ing them from the term: 

Proposition 18 Let^ be a consistent term from PROPpg (viewed as the set of its literals) 
and L C Lps- ForgetLit{'y, L) = /\iej\L^- 

Combining the two previous propositions shows how literals can be forgotten from a 
DNF formula S in polynomial time. It is sufficient to delete every literal of L from each 
disjunct of S (if one of the disjuncts becomes empty then ForgetLit{Tj, L) = true). 

Things are more complicated for conjunctive formulas SA$. Especially, there is no result 
similar to Proposition 17 for conjunctive formulas. While ForgetLit{Yl, L)AForgetLit{^, L) 
is a logical consequence oi ForgetLit{T,A^,L) (see Corollary 2), the converse does not hold 
in the general case. 

Example 12 Let S = a, $ = -la, and L = {a}. Since a i/> <&, we have ForgetLit{^, L) = 
Since ForgetLit{T,, L) is valid, we have {ForgetLit{T,, L) A ForgetLit{^, L)) = -la. 
Since S A $ is inconsistent, ForgetLit{Ti A ^,L) is inconsistent as well. 
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Clearly enough, any non- valid clause S is Lit-independent from L if and only if Lit{5) fl 
L = 0. Since the conjunction of two formulas that are Lit-independent from L is Lit- 
independent from L (see Proposition 2), and since every formula is equivalent to a CNF 
formula. Proposition 16 shows ForgetLit{T,, L) equivalent to the set of all clauses 6 that are 
entailed by S and are from {6 clause | Lit{6)riL = 0}. Because {6 clause | Lit{6)riL = 0} is 
closed under subsumption (i.e., it is a stable production field), it is possible to take advantage 
of consequence finding algorithms (see Marquis, 2000), to derive a CNF representation 
of ForgetLit{T,, L). Especially, in the case where E is a CNF formula, resolution-based 
consequence finding algorithms like those reported in (Inoue, 1992) or (del Val, 1999) can 
be used; this is not very surprising since resolution is nothing but variable elimination. 

In contrast to the disjunctive formulas situation, there is no guarantee that such consequence- 
finding algorithms run in time polynomial in the input size when the input is a conjunctive 
formula (otherwise, as explained in the following, we would have P = NP). Nevertheless, 
forgetting literals within a conjunctive formula S can be easy in some restricted cases, es- 
pecially when S is given by the set of its prime implicates; in this situation, it is sufficient 
to give up those clauses containing a literal from L. 

Proposition 19 Let T, be a formula from PROPps and L C Lps- 

IP{ForgetLit{^, L)) = {S \ S e /P(S) and Lit{S) n L = 0}. 

The notion of literal forgetting generalizes the notion of variable elimination from prepo- 
sitional logic (that was already known by Boole as elimination of middle terms and has been 
generalized to the first-order case in a more recent past by Lin & Reiter, 1994). Indeed, 
variable elimination is about variable forgetting, i.e., the one achieved not considering the 
literals signs: 

Definition 8 (variable forgetting) Let be a formula from PROPps and let V be a 
subset of PS. ForgetVar{'S,V) is the formula inductively defined as follows: 

• ForgetVar{E,$) = E, 

• ForgetVar{T,, {x}) = E^j^i V S^j^o, 

• ForgetVar{T., {x} U F) = ForgetVar{ForgetVar{^, F), {x}). 

Example 13 Let S = (^a V 6) A (a V c). We have ForgetVariT,, {a}) = (6 V c). 

As a direct consequence of the definition, ForgetVar{Ti, {xi, ...,Xn}) is equivalent to the 
quantified boolean formula (usually with free variables!) denoted 3xi . . . 3xn^. 

Clearly enough, forgetting a variable x amounts to forgetting both the literals x and 

Proposition 20 Let H be a formula from PROPps and V C PS. We have 

ForgetVar{T.,V) = ForgetL'it{E, Ly) 

This result, together with the previous results on literal forgetting, gives us the following 
corollaries for variable forgetting: 
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Corollary 5 

Mod{ForgetVar{T., {x})) = Mod{T.) U {Switch{cj, x) \ uj\=J:}. 

Corollary 6 Let T, be a formula and V C PS. ForgetVar{T,, V) is the logically strongest 
consequence ofT, that is Var-independent from V (up to logical equivalence). 

Corollary 7 If a formula ip is Var-independent from V , then \= ip if and only if 
ForgetVar{J:,V) \= ip. 

A consequence of the latter result is that forgetting variables is useful when only a subset 
of variables are really used in the queries. Thus, if S represents some pieces of knowledge 
about a scenario of interest, and we are interested in knowing whether a fact ip is true in the 
scenario, the logical operation to do is to query whether \= if. Now, if the possible facts (p 
we are interested in do not involve some variables V, then these variables can be forgotten 
from E, as querying whether (p is implied can be done on ForgetVarCE, V) instead of S. 

Through the previous proposition, some algorithms for forgetting variables can be easily 
derived from algorithms for forgetting literals (some of them have been sketched before). 
Specifically, polynomial time algorithms for forgetting variables within a DNF formula or a 
formula given by the set of its prime implicates can be obtained. Other tractable classes of 
propositional formulas for variable forgetting exist. For instance, taking advantage of the 
fact that ForgetVar{Y.A^, V) = ForgetVar{Tj, V) AForgetVar{^, V) whenever Far(S) n 
Far($) = holds, Darwiche (1999) showed that variable forgetting in a formula S can 
be done in linear time as soon as S is in Decomposable Negation Normal Form (DNNF), 
i.e., a (DAG)NNF formula in which the conjuncts of any conjimctive subformula do not 
share any variable. Interestingly, the DNNF fragment of propositional logic is strictly more 
succinct than the DNF one (especially, some DNNF formulas only admit exponentially-large 
equivalent DNF formulas) (Darwiche & Marquis. 1999). 

In the general case, just as for the literal situation, there is no way to forget efficiently 
(i.e., in polynomial time) a set of variables within a formula (unless P = NP). Nevertheless, 
the following decomposition property can be helpful in some situations (actually, it is heavily 
exploited in Kohlas et al., 1999). 

Proposition 21 Let S, $ be two formulas from PROPps, and V be a subset of PS. If 
V 0-1 S, then ForgetVar{T. A F) = S A ForgetVar{^, V). 

Note that the corresponding property for literal forgetting does not hold (as a previous 
example shows). 

Forgetting literals or variables proves helpful in various settings (we already sketched 
some of them in the introduction). For instance, minimal model inference (or circum- 
scription McCarthy, 1986) can be expressed using literal forgetting (but not directly using 
variable forgetting, which shows the interest of the more general form we introduced). In- 
deed, it is well-known that closed world inference from a knowledge base S can be logically 
characterized as classical entailment from S completed with some assumptions. In the 
circumscription framework (McCarthy, 1986), given a partition {P,Q,Z) of P5, such as- 
sumptions are the negations of the formulas a s.t. a does not contain any variable from 
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Z and for every clause 7 containing only positive literals built up from P and literals built 
up from Q, if E ^ 7 holds, then E ^ 7 V a holds as well. Equivalently, -la is considered a 
reasonable assumption whenever it is (syntactically) Var-independent from Z and expand- 
ing E with it does not modify what is already known about Lp U Lq. Clearly enough, the 
signs of literals from Lp really matter here. Using our previous notations, -la is assumed 
if and only if Z ly^l -la and E =^+y^-^^ E A -icn. As a consequence, we derive the following 
characterization of circumscription: 

Proposition 22 Let E, $ he two formulas from PROPps, and P, Q, Z be three disjoint 
sets of variables from PS (such that Far(E) U Var{^) C PUQU Z). It holds: 

• If ^ does not contain any variable from Z 

CIRC{^,{P,Q,Z)) ^ $ 
if and only if 
E \= ForgetLit{E A Lp U Lz) 

• In the general case: 

circ{j:,{p,q,z)) ^ $ 

if and only if 

E 1= ForgetLit{J: A -^ForgetLit{'L A -Lz U Lp),Lz U Lp) 

where CIRC is circumscription as defined in (McCarthy, 1986). 

Similar characterizations can be derived for the other forms of closed world reasoning 
pointed out so far. 

Forgetting also is a central concept when we are concerned with query answering w.r.t. 
a restricted target language. Indeed, in many problems, there is a set of variables for which 
we are not interested in their truth value (so we can forget them). For instance, in the 
SATPLAN framework by Kautz, McAllester, and Selman (1996), compiling away fluents or 
actions amounts to forgetting variables. Since the only variables we are really interested in 
within a given set of clauses representing a planning problem instance are those representing 
the plans, we can compile away any other variable, if this does not introduce an increase 
of size of the resulting formula. Another situation where such a forgetting naturally occurs 
is model-based diagnosis (Reiter, 1987): compiling away every variable except the abnor- 
mality ones does not remove any piece of information required to compute the conflicts 
and the diagnoses of a system. Thus, Darwiche (1998) shows how both the set of conflicts 
and the set of consistency-based diagnoses of a system is characterized by the formula ob- 
tained by forgetting every variable except the abnormality ones in the conjunction of the 
system description and the available observations. Provided that the system description 
has flrst been turned into DNNF, forgetting can be achieved in linear time and diagnoses 
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containing a minimal number of faulty components can be enumerated in (output) poly- 
nomial time. Interestingly, this work shows that the diagnosis task does not require the 
(usually expensive) computation of prime implicates/implicants to be achieved (actually, 
computing prime implicates/implicants is just a way to achieve variable forgetting and not 
a goal in consistency-based diagnosis) . Forgetting every variable from a formula allows for 
consistency checking since S is consistent if and only ii ForgetVar{Ti, Var{T,)) is consistent. 
The well-known Davis and Putnam algorithm for satisfiability testing (Davis &; Putnam, 
1960) (recently revisited by Dechter and Rish (1994)under the name directional resolution) 
basically consists in computing a clausal representation of ForgetVar{T,,Var{Yl)) from a 
CNF E using resolution; if the empty clause is not generated, then S is consistent and the 
converse also holds. 

Forgetting can also be used as a key concept in order to organize knowledge so as 
to replace one global inference into a number of local inferences as shown (among oth- 
ers) by Kohlas et al. (1999) and Amir and Mcllraith (2000), Mcllraith and Amir (2001). 
Loosely speaking, such approaches rely on the idea that exploiting all the pieces of infor- 
mation given in a knowledge base is typically not required for query answering. Focusing 
on what is relevant to the query is sufficient. While such techniques do not lower the 
complexity of inference from the theoretical side, they can lead to significant practical 
improvements. For instance, assume that S consists of three formulas $2, and $3. 
For any query let = (ULi \ Var{-^). We have E |= * if and only if 
ForgetVar{/\^^^ F*) ^ If Far($3) n (IJ^Li Var{^i)) = 0, this amounts to test inde- 
pendently whether ForgetVar{f\^_^^i,V-qf) |= ^ holds or ForgetVar{^s,V,s,) |= holds. 
This way, one global inference is replaced by two local inferences. Now, ForgetVar{^i A 
^2,V^) is equivalent to ForgetVar{^i A ForgetVar{^2:V^ fl (Far($2) \ ^a'"(^i))), ^*)- 
Accordingly, every variable from $2 that is not a variable of $1 or a variable of * can be 
forgotten first within $2 because it gives no information relevant to the query; thus, only a 
few pieces of knowledge have to be "propagated" from $2 to $1 before answering the query, 
and forgetting allows for characterizing them exactly. 

As evoked in the introduction, another scenario in which forgetting is useful is that of 
belief update. Indeed, there are many formalizations of belief update that are based on a 
form of variable forgetting. The basic scenario is the following one: we have a formula S 
that represents our knowledge; there are some changes in the world, and what we know is 
that after them a formula ip becomes true. The simplest way to deal with the update is 
to assume that (p represents all we know about the truth value of the variables in ip. As a 
result, we have to "forget" from S the value of the variables in Var{ip). There are different 
formalizations of this schema, based on whether formula is considered to carry information 
about variables it mentions (Winslett, 1990) or only on the variables it depends on (Hegner, 
1987), or also on variables related to dependent variables via a dependence function (Herzig, 
1996). This kind of update schema, while less known than the Possible Models Approach by 
Winslett (1990), has proved to be suited for reasoning about actions (Doherty et al., 1998; 
Herzig Sz Rifi, 1999). Furthermore, the possibility to forget literals (and not variables) is 
also valuable in this framework to take account for persistent information, as shown recently 
by some of us (Herzig et al., 2001), since the polarity of information is often significant. 
For instance, while forgetting the fluent alive from a knowledge base is not problematic, 
forgetting the persistent fluent -^alive would surely be inadequate. 



413 



Lang, Liberatore, & Marquis 



Forgetting can also be used to characterize a dependence relation called definability 
(Lang & Marquis, 1998b) as well as the strongest necessary (resp. weakest sufficient) 
conditions of a propositional variable on a set V of variables given a theory S (Lin, 2000; 
Doherty, Lukaszewicz, Sz Szalas, 2001). As shown in (Lang Sz Marquis, 1998b; Lin, 2000; 
Doherty et al., 2001), all these notions have many applications in various AI fields, including 
hypothesis discrimination, agent communication, theory approximation and abduction. 

Finally, based on literal and variable forgetting, valuable equivalence relations over for- 
mulas can also be defined: 

Definition 9 (Lit-equivalence, Var-equivalence) Let J], $ be two formulas from PRO Pps, 
L be a subset of Lps, and V be a subset of PS. 

• E and $ are said to be Lit-equivalent given L, denoted E =l ^, if and only if 
ForgetL'it{^, Lit{T?) \L) = ForgetLit{^, Lit{^) \ L). 

• S and $ are said to be Var-equivalent given V , denoted S =y if and only if 
ForgetVar{E, Far(S) \V) = ForgetVar{^, Far($) \ V). 

Example 14 Let T. = {a ^ b) A {b ^ c) and ^ = {a ^ d) A {d ^ c). Let L = {^a, c}. S 
and $ are Lit-equivalent given L. 

Such equivalence relations capture some forms of dependence between formulas. They 
are useful in the various situations where it is required to formally characterize the fact that 
two knowledge bases share some theorems. Thus, two formulas are Lit-equivalent given L 
whenever every clause containing literals from L only is a logical consequence of the first 
formula if and only if it is a logical consequence of the second formula. In the same vein, 
two Var-equivalent formulas given V have the same clausal consequences built up from V. 
Clearly, Lit-equivalence is more fine-grained than Var-equivalence in the sense that two 
formulas Var-equivalent given V are also Lit-equivalent given L = Ly, the set of literals 
built upon F, but the converse does not hold in the general case. Some applications are 
related to knowledge approximation ($ is a correct approximation of S over L if and only 
if $ and S are Lit-equivalent given L), normalization (turning a formula S into a CNF $ 
by introducing new symbols is acceptable as long as the two formulas are equivalent over 
the original language, i.e., S and $ are Var-equivalent given V = Far(S)), and so on. 

4.2 Complexity Results 

It is quite easy to prove that forgetting is a computationally expensive operation in the 
general case. Indeed, since a formula S is consistent if and only if ForgetLit{T,, Lit{I])) 
is consistent and since the latter formula is Lit-independent from every literal (i.e., it is 
equivalent to true or equivalent to false), there is no way to compute a formula $ equivalent 
to ForgetLit{T,. L) in polynomial time, unless P = NP. Actually, we can derive the more 
constraining result, showing that the size of any formula equivalent to ForgetLitCE, L) may 
be superpolynomially larger than the size of S. 

Proposition 23 Let T, be a formula from PROPps and let L be a finite subset of Lps- 
In the general case, there is no propositional formula $ equivalent to ForgetLit{Yl, L) s.t. 
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the size of $ is polynomially hounded in |E| + \L\, unless NP fl coNP C P/poly (which is 
considered unlikely in complexity theory). 

This result shows that computing an expUcit representation of ForgetLit{T,, L) under 
the form of a propositional formula is hard, even in a compilation-based approach where 
the time needed to derive such a formula is neglected. 

Finally, we have also derived: 

Proposition 24 (complexity of Lit/ Var-equivalence) 

LiT-EQUiVALENCE and Var-equivalence are II2- complete. 

5. Influenceability, Relevance, and Strict Relevance 

In this section, we show that several notions of dependence introduced in the literature are 
equivalent to, or can be expressed in terms of, semantical independence. In particular, we 
show that Boutilier's definition of influenceability (Boutilier, 1994) is in fact equivalent to 
(semantical) FV-dependence. The definition of relevance as given by Lakemeyer (1997) can 
also be proved to be equivalent to FV-dependence. One of the two definitions given by Lake- 
meyer (1997) for strict relevance can also be expressed in terms of FV-dependence. These 
results allow for finding the complexity of all these forms of dependence as a direct corollary 
to the complexity results reported in the previous section. For the sake of completeness, 
we also give the complexity of the original definition of strict relevance (Lakemeyer, 1995) 
(which is not directly related to FV-dependence), which turns out to be computationally 
simpler than the subsequent definition given by Lakemeyer (1997). 

5.1 Influenceability 

Boutilier (1994) introduces a notion of influenceability. Roughly speaking, a formula S is 
influenceable from a set of variables V if there exists a scenario in which the truth value of 
S depends on the value of the variables in V. This idea can be formalized as follows. 

Definition 10 (influenceability) Let be a formula from PROPps and V a subset of 
PS. S is influenceable from V if and only if there exists a PS \ V -world u), and two 
V -worlds uji and u)2 s.t. ui Awi |= S and a; A ^2 |= -^T, hold. 

In other words, there is a scenario lj in which the formula S can be true or false, 
depending on the value of the variables in V. While influenceability looks different from 
the definitions given in this paper, it can be shown that in fact influenceability coincides 
with FV-dependence. 

Proposition 25 Let H be a formula from PROPpg and V a subset of PS. S is influ- 
enceable from V if and only ifH is Var-dependent on V. 

As a consequence, a model-theoretic characterization of influenceability can be easily 
derived from the one for FV-independence. The complexity of influenceability is an easy 
corollary to this property: INFLUENCEABILITY is NP-complete. 
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5.2 Relevance 

Lakemeyer (1995, 1997) introduces several forms of relevance. We show how these forms of 
relevance are strongly related to FV-independence. We also complete the results given in 
(Lakemeyer, 1997), by exhibiting the computational complexity of each form of relevance 
introduced in (Lakemeyer, 1997). 

5.2.1 Relevance of a formula to a subject matter 

Lakemeyer's notion of relevance of a formula to a subject matter can be defined in terms of 
prime implicates of the formula, as follows (see Definition 9 in Lakemeyer, 1997): 

Definition 11 (relevance to a subject matter) LefE be a formula from PRO Pps and 
V a subset of PS. S is relevant to V if and only if there exists a prime implicate of S 
mentioning a variable from V. 

Example 15 Let S = (a A c) and V = {a, b}. S is relevant to V. 

As a consequence, Lakemeyer's notion of irrelevance of a formula to a subject matter 
coincides with FV-independence. 

Proposition 26 Let T, be a formula from PROPpg and V a subset of PS. S is relevant 
to V if and only if S is Var-dependent on V. 

Thus, the model-theoretic characterization of FV-independence also applies to irrele- 
vance of a formula to a subject matter. We also have that the irrelevance of a formula to 
a subject matter coincides with Boutilier's definition of influenceability. Finally, the above 
proposition allows for an easy proof of complexity for relevance, namely, RELEVANCE OF A 
FORMULA TO A SUBJECT MATTER is NP-COmplete. 

5.2.2 Strict relevance of a formula to a subject matter 

Lakemeyer has introduced two forms of strict relevance. The (chronologically) first one has 
been given in (Lakemeyer, 1995), as follows. 

Definition 12 (strict relevance to a subject matter, Lakemeyer, 1995) Let T, be a 

formula from PROPps and V a subset of PS. S is strictly relevant to V if and only if 
every prime implicate of H contains a variable from V. 

Lakemeyer has also introduced another notion of strict relevance (Lakemeyer, 1997), 
more demanding than the original one. Here we consider an equivalent definition. 

Definition 13 (strict relevance to a subject matter, Lakemeyer, 1997) Let be a 

formula from PROPps and V a subset of PS. S is strictly relevant to V if and only if 
there exists a prime implicate of S mentioning a variable from V, and every prime implicate 
of S mentions only variables from V. 
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Both definitions prevent tautologies and contradictory formulas from being strictly rele- 
vant to any set of variables. The basic difference between these two definitions is that in the 
first one we want that every prime implicate of E contains at least a variable from V, while 
in the second case we impose that every prime implicate of S must contain only variables 
from V^. As the following example shows, there are formulas for which the two definitions 
of strict relevance do not coincide. 

Example 16 Let S = (a V 6) and V = {a}. There is only one prime implicate of S, 
namely ay h. Since it contains at least a variable of V , it follows that E is strictly relevant 
to V w.r.t. (Lakemeyer, 1995). However, since the prime implicate a\/ h is not com.posed 
only of variables of V (because b ^ V), it follows that S is not strictly relevant to V w.r.t. 
(Lakemeyer, 1997). 

Through FV-independence, we can derive an alternative characterization of the notion of 
strict relevance introduced by Lakemeyer (1997). Indeed, as a straightforward consequence 
of the definition, we have: 

Proposition 27 Let T, be a formula from PROPps and V a subset of PS. S is strictly 
relevant to V if and only ifH is Var-dependent on V and Var-independent from Var(T,)\V . 

We have identified the complexity of both definitions of strict relevance, and they turn 
out to be different: the first definition is harder than the second one. 

Proposition 28 (complexity of strict relevance) 

(1) STRICT RELEVANCE OF A FORMULA TO A SUBJECT MATTER (Lakemeyer, 1995) is 
Il2-complete. 

(2) STRICT RELEVANCE OF A FORMULA TO A SUBJECT MATTER (Lakemeyer, 1997) is 
^\^2- complete. 

These complexity results improve Theorem 50 from (Lakemeyer, 1997), which only 
points out the NP-hardness of irrelevance and strict irrelevance as defined in (Lakemeyer, 
1997). 

6. Other Related Work and Further Extensions 

In this section, we first discuss other related work, then some possible extensions of the 
notions and results we have presented before. 

6.1 Other Related Work 

As already evoked, independence has been considered under various forms in various AI 
fields. 

5. Strict relevance as in (Lakemeyer, 1997) could also be shown to be strongly related to controllability 
(Boutilier, 1994; Lang & Marquis, 1998b). 
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6.1.1 Independence in propositional logic 

There are other forms of independence in propositional logic that we have not considered 
in this article, especially, definability, controllability (Lang & Marquis, 1998b) as well as 
conditional independence (Darwiche, 1997). If X, Y and Z are three disjoint sets of variables 
and S is a knowledge base then X and Y are conditionally independent w.r.t. Z knowing 
E if and only if for any Z- world (jJz, once we know (jJz (and S), learning something about 
X cannot make us learn anything new about Y (and vice versa). The computational 
issues pertaining to conditional independence and to stronger notions as well as related 
notions such as relevance between subject matters (Lakemeyer, 1997) and novelty (Greiner 
Sz Genesereth, 1983), have been extensively studied in a companion paper (Lang, Liberatore, 
& Marquis, 2002). 

6.1.2 Implicit and explicit dependence 

Several approaches to belief change make use of a explicit dependence relation, which means 
that it is part of the input (while ours is implicit, i.e., derived from the input). Thus, com- 
puting independence relations from a knowledge base can be seen as an upstream task 
enabling us to specify the "core" (minimal) independence relation upon which the belief 
change operator is based; this core independence relation can then be completed by spec- 
ifying explicitly some additional dependencies using knowledge about the domain. Such 
an approach has been proposed for belief revision in (Farifias del Cerro Sz Herzig, 1996), 
for belief update in (Marquis, 1994) and (Herzig, 1996) and for reasoning about action in 
(Herzig & Rifi, 1999). 

6.1.3 Independence and contexts 

Contextual reasoning (Ghidini Sz Giunchiglia, 2001) has been introduced for formalizing 
domains in which knowledge can naturally be divided into parts (contexts). Each context is 
characterized by its own language and alphabet. The knowledge base of a context contains 
what is relevant to a part of the domain. However, it is not guaranteed that the different 
parts of the domain do not interact, so inference in one context may be affected by the 
knowledge of some other context. 

The main difference between contextual reasoning and independence is that the lat- 
ter is a study of the relevance relation that can be drawn from a "flat" (i.e., not divided 
into contexts) knowledge base; whereas contextual reasoning is on knowledge about specific 
contexts, that is, knowledge is expressed by specifying which context it refers to. In other 
words, the relevance relation is a result of reasoning about knowledge in studying depen- 
dency; on the other hand, it is one of the data that has to be provided for reasoning about 
contexts. 

6.1.4 The definition of irrelevance by Levy et al. 

The definition of irrelevance given by Levy, Fikes, and Sagiv (1997) aims at establishing 
which facts of a knowledge base are irrelevant to the derivation of a query. In particular, 
they consider a first-order logic with no function symbols and a set of inference rules. A 
knowledge base is a set of closed formulas (formulas with no free variables). Derivation of a 
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query (another closed formula) is obtained by applying the inference rules to the knowledge 
base and the logical axioms of the theory. 

A formula (f) of the knowledge base is irrelevant to the derivation of another formula 
■il) if 4> does not "participate" to the process of inferring ij) from the knowledge base. For 
example, in the knowledge base {^(1), C{2),A{X) =^ it is clear that ^(1) is relevant 

to B{1), while C(2) is not. 

This definition becomes complicated when more complex scenarios are considered. Namely, 
Levy et al. consider three different "coordinates": first, whether all derivations are consid- 
ered or just one; second, whether we consider all derivations or just minimal ones; third, 
whether we consider membership to the proof or just derivability from the formulas that 
compose the proof (in this case, we have four possible choices). 

Besides the fact that this notion of irrelevance is based on first-order logic, there are 
other, more substantial, differences between it and the ideas investigated in this paper. 
First, it is a relation between two formulas, given a background knowledge base. As such, 
it is more related to other notions of relevance in the literature (Lang h Marquis, 1998a). 
Second, it is based on the concept of proof, which is something not completely dependent on 
the semantics. For example, replacing the usual modus ponens with the strange inference 
rule a, /3, a =^ 7 — > 7, which does not change the semantics of the logics, then the formula 
C(2) of the knowledge base above becomes magically relevant to B{1). This is perfectly 
reasonable in the approach by Levy et al., where improving efficiency using a specific proof 
theory is the aim. 

6.2 Extending our Notions and Results 

The notions and results presented in this paper can be extended in several directions. 

6.2.1 Numerical and ordinal notions of formula- variable independence; 
forgetting in more general structures 

It may be worth wondering about how the notions of FL-independence and FV-independence 
can be generalized to the case where the knowledge base is not a mere propositional for- 
mula but a probability distribution over fips, or an incomplete probability (or equivalently 
a set of probability distributions), or a stratified knowledge base, or any other ordinal or 
numerical structure. 

First, let us notice that some of our characterizations lead to quite intuitive and general 
ideas. To take the case of FV-independence, we have shown that the following three state- 
ments are equivalent when S is a propositional knowledge base: 

(a) S does not tell anything about x, in any context; 

(b) S can be rewritten equivalently in a formula S' in which x does not appear 

(Definition 4); 

(c) for any two interpretations w and uj' that differ only in the value given to a;, the status 
of w with respect to S (i.e., model or countermodel) is the same as that of (Corollary 1). 

As to variable forgetting: 

(d) ForgetVar{Tj, V) is the most general consequence of S that is Var-independent from V 
(Corollary 16). 
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Now, these definitions have a sufficient level of generality to be extended to the case where 
the knowledge base S is replaced by another structure. 

Thus, some of us have extended the notion of Var-independence (characterized using 
(b)) and variable forgetting to ordinal conditional functions (represented as stratified bases) 
(Lang, Marquis, & Williams, 2001). The basic purpose was to extend the "forget and 
expand" approach at work for updating "flat" knowledge bases (as briefly discussed in 
Section 4) to updating more sophisticated epistemic states. 

The case of incomplete probabilities is also interesting since generalizing (a), (b) and (c) 
will lead to the same intuitive notion ^. As to variable forgetting, it is not hard to notice 
that it corresponds to the well-known notion of marginalization. 

6.2.2 Formula- VARIABLE independence in nonclassical (propositional) logics 

The deflnition of FL- and FV-independence (S can be rewritten equivalently in a formula 
E' in which / (resp. x) does not appear) is quite general, in the sense that the logical 
language and/or the consequence relation (and thus the notion of logical equivalence) may 
vary, which enables us to draw from this principle notions of FL- and FV-independence 
(and from then on, notions of literal and variable forgetting) for nonclassical logics. This 
is what has been done (at least partly) in (Lakemeyer, 1997). Here we briefly consider two 
cases: 

(i) subclassical logics. These logics are built on a classical language but have a weaker con- 
sequence relation than classical logic. For instance, in most multivalued logics, due to the 
fact that the excluded middle (a V -la) is not a theorem, a formula such as a V (-16 A b) 
will depend both on a and on b (whereas it depends only on a in classical logic), because it 
cannot be rewritten equivalently into a formula in which b does not appear (in particular, a 
is not always equivalent to aV (-16A6) in such logics); on the contrary, the formula aV (a A6) 
is equivalent to a in usual multivalued logics and thus depends on a only. 

(ii) modal logics. Now, the language is obtained by extending a classical propositional lan- 
guage with one or more modalities, while the consequence relation extends that of classical 
logic (in the sense that a modality-free formula is a theorem if and only if it is a theorem 
of classical logic). Therefore, results such as Propositions 6 and 7 still make sense and we 
conjecture that they are still valid. To take an example, in the logic 55 where DS =^ ()T, is 
a theorem, a formula such as DS A 0(S V $) is independent from $ while OS A □(£ V #) 
is not. 

6.2.3 Restricted queries and characteristic models 

Forgetting a set of variables modifles a knowledge base while preserving some of the con- 
sequences, namely, those built on variables that are not forgotten. Other frameworks are 

6. This is not the case for single probability distributions: when the knowledge base is represented by a prob- 
ability distribution pr, (a) does not lead to an acceptable definition (because a full probability distribution 
on flps always tell something about x); (c) suggests the definition Wu! € flps,pr{Switch{ui,x)) =pr{ui), 
which leads to the decomposability of pr w.r.t. x: pr is a joint probability distribution obtained from 
a probability distribution prps\{x} on f^ps\{T} and the probability distribution pr^ on {x} defined by 
prx{x) = |(= prx{-ix)). (b) would lead to the same definition, noticing that a full probability dis- 
tribution can be expressed more compactly by a "partial" probability distribution (here, a probability 
distribution on nps\{2!}) from which pr is induced by the maximum entropy principle. 
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based on restricting the possible queries in other ways, i.e., considering only queries in Horn 
form (Kautz, Kearns, & Selman, 1995). 

Reasoning with characteristic models (Kautz, Kearns, h Selman, 1993) is based on using 
only a subset of models of the original formula. As for forgetting, this may increase the 
size of the representation of a knowledge base exponentially. Khardon and Roth (1996) 
have shown that characteristic models can be used for efficient reasoning on some sets of 
restricted queries. 



7. Concluding Remarks 

We have investigated several ways of answering the key question of determining what a 
propositional knowledge base tells about the (in) dependencies between variables and for- 
mulas. For each of the notions studied, we have related it to other previously known notions 
and we have studied it from a computational point of view, giving both complexity results 
and characterization results to be used for practical computation. In the light of our results, 
it appears that the various forms of logical independence are closely connected. Especially, 
several of them had been proposed by different authors without being explicitely related. 
Boutilier's influenceability and Lakemeyer's relevance of a formula to a subject matter are 
identical to FV-independence (Propositions 25 and 26). We also discussed much related 
work, and suggest some extensions to more general framework than mere propositional 
logic. 

The following table gives a synthetic view of many notions addressed in this paper and 
the corresponding complexity results. 



Problem 


Notation 


Definition 


Complexity 


Synt. Lit-independence 




L«i(E) n L = 


P 


Lit-independence 




3$ . $ = S, Lit{<^) n L = 


coNP-complete 


Dependent literals 


DepLit{T,) 


{I 1 {1} ^ E} 


BH2-complete 


FullLit-independence 




L C DepLit{T,) 


coNP-complete 


Lit-simplified 




V/ . {/} S ^ / ^ L'it(S) 


N P-complete 


Lit-equivalence 




ForgetLit{T.,Lit{Y,) \L) = 








ForgetLit{^,Lit{^) \ L) 


n2-complete 


Problems on literals and their complexity. 




Problem 


Notation 


Definition 


Complexity 


Synt. Var-independence 




Far(S) n F = 


P 


Var-independence 


F 01 S 


3$ . $ = S, yar($) nS = 


CO NP-complete 


Dependent variables 


DepVar{T,) 


{v\v^t S} 


BH2-complete 


FullVar-independence 




V C DepVar{T,) 


CO NP-complete 


Var-simplified 




Mv . {v} y/^t ^ V ^ Far(E) NP-complete 


Var-equi valence 




ForgetVar{T., Far(S) \V) = 








ForgetVarl^, Var{^) \ V) 


n2-complete 



Problems on variables and their complexity. 



421 



Lang, Liberatore, & Marquis 



The fact that both notions of formula-variable independence and forgetting have been 
used as key concepts in many AI fields (including automated reasoning, belief revision and 
update, diagnosis, reasoning about actions etc.) has been discussed before (Sections 1 and 
4), so we will refrain from repeating it here. The gain of generality offered by the corre- 
sponding literal-oriented notions introduced in this paper has also been established (e.g., 
Proposition 20), and their application to several AI problems (like closed- world reasoning 
and belief update) has been sketched. 

Primarily, one of the main motivations for the notions of formula- variable independence 
and forgetting was to improve inference from a computational side, by enabling to focus on 
relevant pieces of knowledge. The extent to which this goal can be reached can be discussed 
at the light of our complexity results: 

• Most (in) dependence relations have a high complexity. The notions connected to FV- 
dependence (FL-dependence, full FV- and FL-dependence, influenceability, relevance 
to a subject matter and strict relevance second form) have a complexity at the first 
level of the polynomial hierarchy, which means that they can be checked by a satis- 
fiability or/and an unsatisfiability solver. They become "tractable" when syntactical 
restrictions are made (Proposition 11). Forgetting (literals or variables) also is com- 
putationally expensive. The remaining notions are in complexity classes located at 
the second level of the polynomial hierarchy. Worse, under the standard assumptions 
of complexity theory, the explicit computation of literal or variable forgetting cannot 
be achieved in polynomial space in the general case (Proposition 23). This pushes 
towards the negative conclusion that all these notions are hard to be computed (at 
least in the worst case) except if the size or the syntactical form of the input enables it. 
The fact that these problems fall in the second level of the polynomial hierarchy are 
not that surprising since this is where a large part (if not the majority) of important 
problems in knowledge representation^ fall. 

• But a high worst-case complexity does not necessarily prevent from practical algo- 
rithms! Thus, Amir and Mcllraith have shown the computational benefits that can 
be achieved by structuring a KB (through the forgetting operation) so as to achieve 
inference and consequence finding more efficiently (Amir & Mcllraith, 2000; Mcllraith 
Sz Amir, 2001). A similar discrepancy between the worst case situation and the practi- 
cal ones can be observed in other domains; especially, satisfiability-based checkers for 
quantified boolean formulas (Biere, Cimatti, Clarke, Fujita, Sz Zhu, 1999; Williams, 
Biere, Clarke, & Gupta, 2000) used for formal verification purposes (bounded model 
checking) exhibit interesting computational behaviours (actually, they typically per- 
form better than specialized algorithms, as shown in Rintanen, 2001), despite the 
fact that they are confronted to the problem of variable forgetting (i.e., elimination 
of existentially quantified variables). 

• Moreover, preprocessing may play an important role. What we mean with "prepro- 
cessing" refers to the task of computing (in) dependence relations and forgetting before 

7. Such as abduction, nonmonotonic inference, belief revision, belief update, some forms of planning and 
decision making. 
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performing more problem-oriented tasks such as consequence finding, diagnosis, ac- 
tion/update, decision making etc. Thus, Lit-simplifying (or Var(simpHfying) a KB 
during a preliminary off-line phase can prove helpful for improving on-line inference 
since simplification never increases the size of a KB. As shown by Proposition 23, a 
similar conclusion cannot be drawn to what concerns forgetting. This seems to be the 
price to be paid to benefit from the power of forgetting. However, this negative conclu- 
sion must be tempered by the two following comments. On the one hand, forgetting is 
interesting per se; it is not only a tool that can help improving inference in some cases 
but also a goal in several AI applications. On the other hand, our complexity results 
relate to the worst case situation, only, and, as evoked before, forgetting is feasible in 
many practical cases. Finally, let us note that there are several complete propositional 
fragments for which forgetting is easy. Especially, as advocated by Darwiche (1999), 
compiling a KB into a DNNF formula during an off-line step can prove practically 
valuable to achieve forgetting in an efficient way, provided that the size of the compiled 
form remains small enough (which cannot be guaranteed in the worst case). Since it 
is not known whether the DNNF fragment is strictly more succinct than the prime 
implicates one (Darwiche Sz Marquis, 1999), the prime implicates fragment can also be 
targeted with profit as a compilation language for some knowledge bases; especially, 
some recent approaches to the implicit representation of prime implicates (Simon &; 
del Val, 2001) exhibit very significant empirical performances (they enable the com- 
putation of sets of prime implicates containing up to lO^'' clauses). Accordingly, they 
can prove valuable for the practical computing of independence and forgetting. 
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Appendix A: Proofs 

Proposition 1 A formula E is Lit- independent from I if and only if, for any interpretation 
u) € ^ps, «/w 1= S then Force{u),^l) |= S. 

Proof: Assume that S is Lit-independent from Then, there exists a formula $ in NNF 
that is equivalent to S, and does not contain Then, for any u G ^ps such that a; |= $ 
we have Force{u), -■/) |= Since $ is equivalent to S, we conclude that the same property 
holds for E. 

Assume that, for any interpretation cj G S^ps, a; |= S implies Force{LJ,^l) \= S. We 
prove that S is Lit-independent from Indeed, let be the term whose only model is lj. 
The following equivalence holds: 

S = VITc. I H 

= \/{7a) 1^1=^ ^11*^1 (xi ^ /} U {'foj I w 1= S and w |= 1} 

= yiluj I <^ H ^ and a; ^ /} U {7^ V 7Force(a;,-0 I w 1= S and w |= /} 

The latter step can be done because w \= T, implies that Force{u),^l) is also a model 
of S. Now, if w ^ / then 7^^ does not contain I. On the other hand, if w |= then 
7a; V 7Force(w,-.0 ^an be rewritten as a conjunction of literals not containing neither / nor 
its negation. As a result, the above formula (which is in NNF) does not contain /, which 
means that S is Lit-independent from o 

Proposition 2 

(1) DepLiti^) C Lit{T,); 

(2) IfT. = ^, then DepLit{T) = DepLit{<^): 
(3a) DepLit{T. A $) C DepLit{T.) U DepLit{^); 
(3b) DepLit{T, V C DepLit{Y!) U DepLit{^): 
(4) I G DepLit{T.) if and only if ^l € DepLit{^T,). 

Proof: 

1. Trivial. 

2. L i->^ E if and only if there exists a formula * that is equivalent to S, and such that 
* is syntactically Lit-independent from L. Since S = it follows that $ = 

3. Let ^ (resp. H) be a NNF formula equivalent to S (resp. $) s.t. no literal of L occurs 
in it. Then * A 11 (resp. ^ V H) is a formula equivalent to S A $ (resp. S V which 
is in NNF, and no literal of L occurs in it. 

4. Straightforward from the fact that / appears in a NNF formula S if and only if 
appears in the NNF form of -lE. 
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Proposition 3 

(1) DepVar{T.) C Far(S); 

(2) If^ = <^>, then De,pVar{i:) = DepVar{^); 
(3a) DepVar{Y. A C DepLit{Y.) U DepLit{^); 
(3b) DepVar{T. V C DepLit{T!) U DepLit{^); 
(4) DepVar{-n'E) = DepVar{T.). 

Proof: (1) is trivial; (2) and (3) are similar to the proof of points (2) and (3) of Propo- 
sition 2, replacing "literal" by "variable", / by v, and DepLit by DepVar. As to (4): if 
X ^ DepVar (Y,) then there exists a formula $ equivalent to S in which x does not appear; 
since x does not appear in either, is a formula equivalent to -iS in which x does not 
appear. o 

Proposition 4 Let H be a formula from PROPps- 

• is Lit-simplified if and only if the following equivalence holds: for every L C Lps, 
S is syntactically Lit-independent from L if and only if Lv/^Yu holds. 

• Y, is Var-, 'simplified if and only if the following equivalence holds: for every V C PS, 
S is syntactically Var-independent from V if and only if V S holds. 

Proof: 

• Lit-simplification 

— Assume E is Lit-simplified (thus Lit{Y) = DepLit{Y)) and let L C Lps- 
If S is syntactically Lit-independent from L, then L n L«i(S) = 0, thus L n 
DepLit{Y) = 0, i.e., L E. 

— <^=: Assume that E is not Lit-simplified. Then there exists I G Lit{Y) s.t. E 
is Lit-independent from I. With L = {I}, it is clear that E is syntactically 
Lit-dependent on L, while Lit-independent from it, contradiction. 

• Var-simplification. The proof is similar to the Lit-simplification case, replacing "Lit- 
independent" by "Var-independent", L by V, Lpg by P5, / by a;, Lit{Y) by Far(E). 



Proposition 5 For every E from PROPps, there exists a Lit-simplified formula $ s.t. 
E = 

Proof: Since E is Lit-independent from L \ DepLit{Y) we know that there exists a 
NNF formula $ equivalent to E such that Lit{^) Ci {L \ DepLit{Y)) = 0, i.e., such that 
Lit{^) C DepLit{Y). By point (2) of Proposition 2 we have DepLit(^) = DepLit{Y). 
Thus, DepLit{^) C Lit{^)) C DepLit{Y) = DepLit{^), from which we conclude that 
DepLit{^) = Lit{^), i.e., $ is Lit-simplified. <> 

Proposition 6 Let Y be a formula from PROPpg and I be a literal of Lpg. The next 
four statements are equivalent: 
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(1) I S; 

(2) S^^i 1= s^^o; 

(3) s 1= s^^o; 

(4) 1= s. 

Proof: 

(1) ^ (2): Let v the variable of / and assume that ^ S;^o, which means that there is 

a PS\ {wl-world w such that w |= A -iS;^o; since S is equivalent to {I A V 
{^l A S;^0), we have oj A {/} |= S^^i A / ^ S and Force{uj A {/}, ^l) = lo A {^1} ^ 
A -iS/^o; hence, Force{uj A {/}, ^ S and therefore / i-^ S by Proposition 1. 

(2) ^ (3): Assume S;^i |= S;^o- We have the following chain of implications: 

E = (/AS,^i) VHaE;^o) 
S H (/ A S;^o) V H A E;^o) 

(3) =^ (1): Let us assume that S |= S^^-o, and prove that / i/> S. Indeed, the assumption 

can be rewritten as: 

Vcj G flps . 1= S ^ a; 1= S;<^o) 

Now, (jj \= S;^o is equivalent to say that changing the truth value of / to false, lj is 
still a model of S. In formulas, 

yu; e Qps . {co \= T, ^ Force{iJ, ^l) \= S) 

This is exactly the definition of / i/> E. 
(2) ^ (4): Same as the proof of (2) ^ (3). 

(4) ^ (1): Similar to the proof of (3) ^ (1). 

o 

Proposition 7 Let H be a formula from PROPps and x be a variable of PS. The next 
four statements are equivalent: 

(1) X S; 

(2) Sjj^o = ^x-i^li 

(3) S = Sa^^o; 

(4) S = Ss^i- 

Proof: Easy consequence of the definition of FV-independence (E is Var-independent from 
X if and only if it is Lit-independent from {a;, -■a;}), and Proposition 6). o 

Proposition 8 Let S he a formula from PROPps and L be a subset of Lpg- The next 
statements are equivalent: 
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(1) L E; 

(2) P/(S) C {7 

(3) /P(S) C {S 



J is a term that does not contain any literal from L}; 
S is a clause that does not contain any literal from L}. 



Proof: 



• (1) 



^(2): 



— If S is Lit-independent from L, then there exists a NNF formula $ equivalent 
to S s.t. Lit{^) n 1/ = 0. Clearly enough, the property Lit{^) n L = is still 
satisfied if $ is turned into DNF, especially when $ is turned into its prime 
implicant normal form. Since two equivalent formulas have the same prime 
implicants, no term of contains a literal from L. 

— is a NNF formula that is equivalent to E and syntactically Lit- 
independent from L. Hence, S is Lit-independent from L. 

• (1) <^ (3): Similar to the prime implicant situation, using CNF instead of DNF. 



Proposition 9 Let T, be a formula from PROPps and V he a subset of PS. The next 
statements are equivalent: 



(2) PI{^) C {7 I 7 is a term that does not contain any variable from F}; 

(3) IP{^) C {5 I 5 is a clause that does not contain any variable from F}. 

Proof: Easy consequence of the definition of FV-independence, plus Proposition 9 above, o 

Proposition 10 FL dependence, FV dependence, full FL dependence and full 
FV dependence are HP-complete. 



• FL dependence 

— Membership. In order to show that S is Lit-dependent on L, it is sufficient to 
guess a literal I from L and a (Far(E) \ {Far(/)})-world oj that is a model of 

but not a model of S^^q. These tests can be achieved in time polynomial 

in 

— Hardness. Let us consider the mapping M s.t. M(S) = (E A new, new), where 
new is a propositional variable that does not occur in S. Clearly enough, M(S) 
can be computed in time polynomial in Moreover, S is satisfiable if and only 
if E A new is Lit-dependent on {new}. 

• FV dependence 



o 



(1) V E; 



Proof: 
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— Membership. In order to show that S is Var- dependent on V, it is sufficient to 
guess a variable x from V and a (Far(S) \ {a:})-world lj that is not a model of 
the formula '^x-i^o '^x^i- This test can be achieved in time polynomial in |S|. 

— Hardness. Similar to the hardness proof of FL-dependence, replacing "Lit- 
dependent" by "Var-dependent" . 

• full FL dependence 

— Membership. E is fully Lit-dependent on L = {li, ...,/„} if and only if S^^^i ^ 
Sij<_0 and ... and Y= S/„-f-0 holds. Equivalently, E is fully Lit-dependent 
on L = {/i,. ..,/„} if and only if the formula renamei(S;j^i A -iS^j^o) A ... A 
rename^ (S;^<_1 A -iS;^^o) is satisfiable, where each renamei is a renaming that 
maps variables to new symbols (in a uniform way). Since this formula can be 
computed in time polynomial in |S| + the membership of FULL FL DEPEN- 
DENCE to NP follows. 

— Hardness. Full FL-dependence and FL-dependence coincide in the case in which 
L is composed of a single literal. Since the NP-hardness of FL-dependence has 
been proved using a set L composed of a single literal, the NP-hardness of full 
FL dependence follows. 

• full FV dependence 

— Membership. S is fully Var-dependent on V = {xi, if and only if E ^ 
Ylxi<^o and ... and S ^ S^j^^o holds. Equivalently, S is fully Var-dependent on 
V = {xi, ...,Xn} if and only if the formula renamei(S©S3;j^o) A...Arename„(S© 
Sx„-«-o) is satisfiable, where each renamci is a renaming that maps variables to 
new symbols (in a uniform way). Since this formula can be computed in time 
polynomial in |S| + |F|, the membership of FULL FV DEPENDENCE to NP follows. 

— Hardness. Full FV-dependence and FV-dependence coincide if L is composed of 
a single literal. Since the NP-hardness of FV-dependence has been proved using a 
set L composed of a single literal, the NP-hardness of full FV-dependence follows. 

o 

Proposition 11 Whenever E belongs to a class C of CNF formulas that is tractable for 
clausal query answering (i.e., there exists a polytime algorithm to determine whether S |= 7 
for any CNF formula 7^ and stable for variable instantiation (i.e., replacing in T, € C any 
variable by true or by false gives a formula that still belongs to C) then FL DEPENDENCE, 
FV DEPENDENCE, FULL FL DEPENDENCE and FULL FV DEPENDENCE are in P. 

Proof: This is a straightforward consequence of Propositions 6 and 7. When E belongs to 
a class C of formulas that is tractable for clausal query answering and stable for variable 
instantiation, we can easily check whether S |= S^j^o and Ti^^o \= S holds. o 
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Proposition 12 LiT-SIMPLIFIED FORMULA and VAR-SIMPLIFIED FORMULA are NP-complete. 

Proof: 

• Lit-simplification 

— Membership. Easy consequence of the fact that Lit- simplified formula is 
a restriction of FULL FL DEPENDENCE that is in NP (and NP is closed under 
polynomial reductions). 

— Hardness. We prove that a formula built over an alphabet X = {xi, . . . , Xn}, 
is satisfiable if and only if formula S is L-simplified, where 

E = ($V$[X/-X]) A /\ Xi^^Vi 

XiGX 

where is the formula obtained by replacing in $ each occurrence of Xi 

(resp. ^Xi) with ^x-i (resp. x-i). 

First, if $ is not satisfiable, neither ^[X/^X] is, thus V ^[X/^X]) is not 
satisfiable. As a result, S is not satisfiable, thus it is not Lit-simplified, because 
DepLit{Yl) = but S mentions variables xi and yi. 

Assume $ satisfiable. Clearly, S is satisfiable as well: let a; be a model of S. We 
prove that S is Lit-simplified by showing that it is Lit-dependent on each literal it 
contains. We have Lit{T) = a:;„, yi, ...,?/„, -.a;„, -.yi, -.?/„}. 

Let k G Lit{T). 

(1) Assume that u) \= li. Then, Force{uj, -^li) ^ S. Indeed, let Var{li) = xi 
(resp. Ui): lv satisfies Xi <^ yi so changing the truth value oi Xi only (resp. yi 
only) leads to a model that does not satisfy Xi ^ yi. 

(2) Otherwise, we have uj |= -i/j. Replacing li by -i/j in the proof of case (1) 
enables deriving the expected conclusion. 

— Var-simplification 

* Membership. Easy consequence of the fact that Var- simplified formula 
is a restriction of FULL FV DEPENDENCE that is in NP (and NP is closed 
under polynomial reduction). 

* Hardness. The proof is similar to the proof of NP-hardness of LIT-SIMPLIFIED 
FORMULA, replacing "Lit-simplified" with "Var-simplified" . 

o 

Proposition 13 

1. Determining whether DepLit{T,) = L (where L is a set of literals), and determining 
whether DepVarCS) = X (where X is a set of variables) is BH2-complete. 

2. The search problem consisting in computing DepLit{Yl) (respectively DepVar{Ti)) is 
in FA2 and is both NP and coNP-hard. 
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Proof: 

• Determining whether DepLit{T,) = L is BH2-complete. 

— Membership. 

DepLit{T,) = L if and only if (i) I i-)- S holds for every I G L and (ii) / 17^ S 
holds for every I G Lit{Tj) \ L; the set of instances (S,L) such that (i) holds is 
the union of a linear number of problems in NP, thus it is in NP; similarly, the 
set of instances (S, L) such that (ii) holds is a problem in coNP. This proves the 
membership to BH2 of the problem of determining whether DepLit{Yl) = L. 

— Hardness. 

Let r) be a pair of propositional formulas. Without loss of generality, we as- 
sume that $ and F do not share any variable and that X = Var{(k) = {xi, a;„}, 
Far(r) = {^1, ...,Zp}. Furthermore we assume that X / (if it were not the case 
it would then be sufficient then to replace $ by $ A (t V -it) before performing 
the reduction). We prove that F) is a positive instance of sat-unsat, i.e., $ 
is satisfiable and F is unsatisfiable, if and only if DepLit{<^ A -iF) = Lit(S($)) 
where S($) = ($ V $[X/-iX]) A Ax^ex Vi ^ previous proof (note that 
E($) and F do not share any variables). 

(i) if $ is satisfiable and F is unsatisfiable, then S(<I') is Lit-simplified ac- 
cording to the previous proof and thus DepL«t(S($)) = Lit(S(#)) whereas 
DepLit{-nT) = 0, which together entail that DepLit{'E{^) A ^F) = Lit{J:{^)). 

(ii) Assume that $ is unsatisfiable and DepLit{T,{^) A -iF) = X«t(S($)). Then 
E(<1>) is not simplified (recall that Var{^) is not empty and hence so is L«t(S($))), 
thus DepLit{Y:{^)) C L«t(E($)). Now, DepLit{T,{^) A^T) = Lit(E($)) implies 
DepLit{'E,{^) A -iF) n Lit{r) = 0, which is possible only if F is a tautology (be- 
cause S($) and F do not share any variables); but in this case, E($) A -iF is 
inconsistent and thus DepLit(E{^) A -iF) = 0, hence, L«t(E($)) = 0, which is 
contradictory. 

(iii) Assume that F is satisfiable and DepLit{Il{^) A -iF) = L«t(E($)). 

The second condition can hold only if S(<1>) is unsatisfiable, hence not Lit- 
simplified (since _Lit(S (<]>)) 7^ 0), so $ is unsatisfiable as well; this takes us 
back to the case (ii), which leads us to the same contradiction again. 

• Computing DepLit{T,) is in FA2 since FL dependence is in NP and DepLit{T) C 
Lit{Yl) (hence, it is sufficient to test for every / G Lit{Yl) whether or not E is Lit- 
independent from it). It is also both NP-hard and coNP-hard. NP-hardness is showed 
by the following polynomial reduction from SAT: A CNF formula F is satisfiable 
if and only if F is valid or DepLit{F) / 0; clearly enough, valid CNF formulas 
can be recognized in polynomial time. Thus, SAT can be solved if we know how to 
compute DepLit{Yl) for any S, which shows that computing DepL'it{Yi) is NP-hard. 
The proof of coNP-hardness is similar {F is unsatisfiable if and only if F is not valid 
and DepLit{F) = 0). 
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• Determining whether DepVar{Yl) = V is BH2-complete. Membership follows easily 
from the membership of the corresponding problem for DepLit{Yl). Hardness is similar 
to the hardness proof for DepLit{ll) = L. 

• Computing DepV ar{Tj) is in FA2 and is both NP-hard and coNP-hard. Similar to the 
corresponding result for DepLit{T,). 

o 

Proposition 14 The set of models of ForgetLit{I], {I}) can be expressed as: 

Mod{ForgetLit{T., {I})) = Mod(S) U {Force(w, -^l) | w |= E} 

= {uj I Force{uj, I) \= S} 

Proof: The proof is obtained immediately from the definition. o 

Proposition 15 The set of models of ForgetLitCE, L) can be expressed as: 

Mod{ForgetLit{T,, L)) = {lj \ Force{LJ,Li) \= S where Li C L} 

Proof: By induction on \L\. The base case in which L is empty is trivial. Let now assume 
that the property holds for any L composed of k elements, and prove that it holds as well 
for L U {/}• 

By definition, 

Lj \= ForgetLit{T., L U {/}) 

if and only if cv \= ForgetLit{ForgetLit{Yl, L), {I})) 

if and only if to |= ForgetLit{T,, L) or Force{uj, I) \= ForgetLit{T,, L) 

if and only if Force{u), L') \= ForgetLit{T,, L) where L' C {/} 

Using the induction hypothesis, we can express the set of models of ForgetLit{T,, L) as 
the models lj' such that Force{Lj\ Li) \= S, where Li is any subset of L. As a result, 

CjJ 1= ForgetLitiT,, L U {/}) 

if and only if Force{u), L') = u' and Force{u)' , Li) \= E, where L' C {/} and Li <Z L 
if and only if Force{uj, L'l) |= S where L'^ C L U {/} 

As a result, the models of ForgetLit{T,, L) are the models that can be mapped into 
models of S by forcing a subset of literals in L to become true. o 

Proposition 16 Lei T, be a formula from PROPps and L C Lpg- ForgetLitCE, L) is 
the logically strongest consequence ofT, that is Lit- independent from L (up to logical equiv- 
alence). 

Proof: By induction on |L|. The base case \L\ = is trivial. Let us now assume that 
the proposition holds for every |L| < A; and show that it remains true when |L| = A; + 1. 
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Let L = L' U {I}. By the induction hypothesis, we can assume that ForgetLit{Yl, L') 
is the most general consequence of E that is Lit-independent from L'. For the sake 
of simplicity, let S' denote this formula. It remains to show that ForgetLit{'E, L) = 
ForgetLit{ForgetLit{T,, L'), {I}) = F or get Lit {T,' , {I}) is the most general consequence of 
S' that is Lit-independent from Two cases are to be considered: 

• I = X. ForgetLit{T,' , x) is Lit-independent from x if and only if 
ForgetLit{T/ ,x) \= ForgetLit{T,' , x)x4^o holds. We have 

(E^^i V i^x A S'))x^o = iK^i V K^o) 

This formula clearly is a logical consequence of ForgetLit{T,' , {x}). Hence ForgetLit{T,' , x) 
is Lit-independent from x. ForgetLit{Yl' ,x) is a logical consequence of S'. Indeed, 
for every E' € PROPps, x € PS, we have T,' = {x A S^^^) V {^x A S^^g). It remains 
to show that every logical consequence $ of E' that is Lit-independent from a; is a 
logical consequence of ForgetLit{Il' ,x). Let $ any formula s.t. S' |= $ holds and 
$ 1= $x<i-o- Since H' = {x A 'S'^^i) V {^x A S^^g) holds, we have E' |= $ if and only 
if both (x A E^^ J 1= $ and {^x A E^^g) 1= ^ hold. Thus, in order to show that 
ForgetLit{T,' ,x) \= $ holds, it is sufficient to prove that E^^,^^ |= $ holds. Suppose 
that is not the case. Then, there exists an implicant 7 of E^^^ that is not an implicant 
of However, since (2; A E^^^) |= $ holds, we know that {x A 7) is an implicant of 
Since $ is equivalent to the conjunction of its prime implicates, there necessarily 
exists a prime implicate tt of $ s.t. (a; A 7) |= tt and 7 ^ vr hold. This imposes that x 
belongs to tt but no literal of 7 belongs to tt. By construction, t^x^^q is an implicate of 
$3;^o and TTai^o is strictly stronger than tt. Since $ |= ^x-i^a holds, $ |= 'Kx-i^q holds 
as well. This contradicts the fact that tt is a prime implicate of 

• / = -ix. The demonstration is similar, mutatis mutandis. 

o 

Proposition 17 Let E, $ be two formulas from PROPps and L C Lpg- 
ForgetLit{T, V L) = ForgetLit{T,, L) V ForgetLit{^, L). 



Proof: The claim can be easily proved from Proposition 15. Indeed, the models of 

ForgetLit{T,\/^, L) are the models oj such that Force{u!, Li) |= EV$, where Li C L. Now, 
Force{LJ,Li) |= E V $ holds if and only if Force{u], Li) \= E holds or Force{uj,Li) \= $ 
holds. 

On the other hand, the models of ForgetLit{Yl, L) V ForgetLit{^, L) are those such that 
Force{uj, Li) |= E or Force{u),Li) \= $ for some Li C L. This is equivalent to the above 
condition. o 
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Proposition 18 Let j be a consistent term from PROPps and L C Lps- 
ForgetLit{-f,L) = Aiejs.t.liL^- 

Proof: We first prove the following lemma: 

Lemma 1 For any consistent term 7, we have ForgetLit{'y, /) = 7 \ {/}. 

Proof (lemma): 

case 1 / G 7, i.e., 7 = / A 7'. 

ForgetLit{j, I) = ji^i V V 7) = 7' = 7 \ {/}. 

case 2 G 7, i.e., 7 = A 7'. 

ForgetLit{'y, /)=_LV7 = 7 = 7\ {/}. 

case 3 I ^ 'J and ^ 7. 

ForgetLit{j, /) = 7 V V 7) = 7 = 7 \ {/}. 



A straightforward induction on L completes the proof o 

Proposition 19 Let Tj be a formula from PROPps and L C Lps. 
IP{ForgetLit{^, L)) = {S \ S e /P(S) and Lit{5) n L = 0}. 

Proof: 

• C: Let 5 e IP{ForgetLit{T,,L)). Since E |= ForgetLit{T,,L) and ForgetLit{T,,L) ^ 
5 hold, J is an implicate of S. Hence, there exists a prime implicate 5' of S s.t. J' |= 5 
holds. Since 8 does not contain any literal from L, this is also the case for 8' . Thus, 
8' is a logical consequence of S that is Lit-independent from L. As a consequence of 
Proposition 16, it must be the case that ForgetLit{T,, L) \= 8'. Hence, there exists a 
prime implicate 8" of ForgetLit{T,, L) s.t. 8" \= 8' holds. This implies that 8" |= 8 
holds as well, and since both clauses are prime implicates of the same formula, they 
are equivalent. But this implies that 8 = 8' holds, which completes the proof. 

• D: Let ^ be a prime impUcate of E that does not contain any literal from L. 8 \s 
Lit-independent from L. Since 8 is an implicate of S, it must also be an implicate of 
ForgetLit{T,,L). Subsequently, there exists a prime implicate 8' oi ForgetLit{T,,L) 
s.t. 8' 8 holds. Since S |= ForgetLit{E, L) and ForgetLit{T.,L) ^ 8' both hold, 
we have E |= 5' as well. Hence, there exists a prime implicate 8" of E s.t. 8" \= 8' 
holds. Therefore, 8" |= 8 holds and since both clauses are prime implicates of the same 
formula, they are equivalent. But this implies that 8 = 8' holds, which completes the 
proof 
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Proposition 20 Let T, be a formula from PROPps and V C PS. We have 

ForgetVar{T.,V) = ForgetLit{E, Ly) 

Proof: By induction on \V\. The proposition trivially holds for \V\ =0. Let us assume it 
is true whenever \V\ = k. Let V = {vi, ...,Vk+i}- By definition, we have 

ForgetVar{T,, V) = ForgetVar{ForgetVar{T,, {vi, u^}), {w^+i}) 

By the induction hypothesis, ForgetVar{T,,V) is equivalent to ForgeiFar($, {u/c+i}), 
where $ is defined as: 

$ = ForgetLit(Jl, {x; x G {vi, . . . , v^}} U {^x; x G {vi, . . . , ija;}}) 

We have ForgetVar{T,,V) = ForgetVar{^, {vt+i})- By definition, 

ForgetVar{^,Vk+i) = $«j,+i^o V ^v,,+i^i- 

We also have 

ForgetLit{^, W+i, -^Vk+i}) 

= ForgetLit{ForgetLit{^, {vk+i}), {^Wfe+i}) 

= ForgetL'it{{^Vk+i A $i,,+i^o) V 

This simplifies to {v^^i A $^^,^^^1) V $^^.^^^0 V which is also equivalent to 

$„j.^j^o V hence equivalent to ForgetVar{^, {vk+i})- Consequently, we have 

ForgetVar{^,V) 

= ForgetLit{ForgetLit{T,, {x;x € {vi, . . . , Vk}} U {^x; x e{vi,..., Vk}}), {vk+i,^Vk+i}) 
= ForgetLit{T,,V). 

o 

Proposition 21 Let S, $ 6e too formulas from PROPps, and V he a subset of PS. If 
V 0-1 S, then ForgetVar{J: A F) = E A ForgetVar{^, V). 

Proof: Let us consider the case where V = {v}. By definition, we have ForgetVar{T, A 
= (S A $)„^o V (S A Equivalents, ForgetVar{T, A = (S„^o A 

$„^o) V (S„4_i A When v 0-1 S, we have S = S„^o = S^^i. Accordingly, 

ForgetVar{T, A {?^}) = S A ($„^o V = S A For5ei($, {-;;}). A straightforward 

induction completes the proof. o 

Proposition 22 Let S, <& be two formulas from PROPps, and P, Q, Z be three disjoint 
sets of variables from PS (such that Var{T,) U Var{<^) C PUQU Z). It holds: 
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1. If ^ does not contain any variable from Z 

circ{j:,{p,q,z)) ^ $ 

if and only if 
E \= ForgetLit{T. A Lp U Lz) 

2. In the general case: 

CIRC{^,{P,Q,Z)) ^ $ 
if and only if 

S 1= ForgetLit{i: A -^ForgetLit{T, A -Lz U L~),Lz U L~) 
where CIRC is circumscription as defined in (McCarthy, 1986). 
Proof: 

1. This is a consequence of Theorem 2.5 from (Przymusinski, 1989). This theorem 
states that if $ does not contain literals from Z, then CIRC{T,, {P,Q, Z)) \= $ 
holds if and only if there is no clause 7 s.t. 7 does not contain any literal from 
Lp U Lz and S |= V 7 but S ^ 7. This is equivalent to state that, if $ does 
not contain literals from Z, then CIRC{T,, {P,Q, Z)) |= $ holds if and only if, for 
every clause 7 containing only literals from Lp U Lq, we have S A $ |= 7 if and 
only if S 1= 7. It is easy to see that the equivalence is preserved would any formula 
7 Lit-independent from Lp U Lz be considered (any formula can be turned into an 
equivalent CNF formula). Thus, Theorem 2.5 can be rephrased in forgetting terms: if 
$ does not contain variables from Z, then CIRC{T,, {P, Q, Z)) \= $ holds if and only 
if S A $ =l+uLq ^ if if ^ 1= ForgetLit{T. A Lp U Lz). 

2. This is a consequence of Theorem 2.6 from (Przymusinski, 1989). This theorem 
states that CIRC{T^,{P,Q,Z)) |= $ holds if and only if S |= $ or there exists a 
formula * s.t. ^ does not contain any literal from Lp U Lz, S |= $ V ^ holds and 
CIRC{Tj, (P, Q, Z)) 1= -1^. It is easy to see that such a formula * exists if and only if 
the conjunction a of all the formulas * such that * does not contain any literal from 
LpULz and S \= holds is s.t. CIRC{T,, (P, Q, Z)) \= -^a. Since S |= holds 
if and only if S A |= * holds, a is equivalent to PorgetL«t(SA-i$, LpUL^). Thus, 
CIRC{J:, (P, Q, Z)) 1= $ holds if and only if E |= $ holds or CIRC{T., (P, Q, Z)) ^ 
-^ForgetLitCE A Lp U L^) holds. In the case where E |= $ holds, S A is 
inconsistent, so it is also the case of ForgetLit{Yl A -i$,Lp U Lz). Thus, if S |= $ 
holds, CIRC{T,, (P, Q, Z)) \= -^ForgetLit{EA^^, L'ULz) holds as well. Accordingly, 
CIRC{J:, (P, Q, Z)) ^ $ holds if and only if CIRC{T., (P, Q, Z)) |= -^ForgetLit{i: A 
-1$, Lp U Lz) holds. Since ForgetLit{T, A Lp U Lz) does not contain any literal 
from Z, the point just above enables concluding the proof 
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Proposition 23 Let T, be a formula from PROPps and let L be a subset of Lps- In the 
general case, there is no propositional formula $ equivalent to ForgetLit{T,, L) s.t. the size 
of ^ is polynomially bounded in |S| + \L\, unless NP n coNP C P/poly. 

Proof: The justification is twofold: 

(1) ForgetLit{T., L) is the logically strongest consequence of S that is Lit-independent 
from L. Consequently, for every formula 7 G PROPps, we have S |= 7 if and only if 
ForgetLit{Y,,L) \= 7, where L = Lit{T,) \ Lit{'y). Because ForgetLit{T,, L) depends only 
on literals of Lit{'E) H Lit{'y), it is an interpolant of S and 7, i.e., a formula ip s.t. T, \= ip 
and 1= 7 hold; thus we have S |= ForgetLit{T,, Lit{T,) \ Lit{'y)) \= 7. 

(2) the existence of a propositional formula, interpolant of E and 7, and of size polynomially 
bounded in |E| + I7I would imply that NP n coNP C P/poly (Boppana & Sipser, 1990), 
which is considered very unlikely in complexity theory. o 

Proposition 24 LiT-EQUIVALENCE and Var-EQUIVALENCE are U^-complete. 



• Var-equivalence. 

— Membership: In order to check the membership to the complementary problem, 
guessing a clause 7 built up from V, and checking that (S |= 7 and $ ^ 7) or 
(S ^ 7 and $ 1= 7) is sufficient. The check step can be easily accomplished 
in polynomial time when an NP oracle is available. Hence, the complementary 
problem belongs to Ti^. 

— Hardness: Let M be the mapping that associates the triple {T,, true, A) to the 
quantified boolean formula yABBT, (where {A,B} is a partition of Far(S)). 
Clearly enough, M is polytime. Moreover, we have: 



Since the validity problem for 2-QBF formulas is H2-complete, this proves the 
H2-hardness of Var-equivalence. 

• Lit-equivalence. 

— Membership: See the membership proof above, replacing "built up from F"by 
"s.t. Liti-y) C L'\ 

— Hardness: Let M be the mapping that associates (S, <I>, V) to (S, Ly)- Clearly 
enough, M({S, F)) can be computed in time polynomial in We 
have shown that S and $ are Var-equivalent given V if and only if S and $ are 



Proof: 



yA3B'S is valid if and only if 
if and only if 
if and only if 



1= ForgetVar{T,,B) 
E =A true 



1= 35 . E 
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Lit-equivalent given Ly- Hence, M is a polynomial many-one reduction from 
Var-equivalence to LiT-EQUiVALENCE. Since Var-equivalence is n^-hard, 
this is also the case for Lit-equivalence. 

o 

Proposition 25 Let be a formula from PROPps and V a subset of PS. S is influ- 
enceable from V if and only if T, is Var-dependent on V. 

Proof: Proposition 4 from (Boutilier, 1994) states that S is influenceable from V if and 
only if there exists a prime implicant of S that contains a variable from V, where V is the 
set of controllable variables. Proposition 9 completes the proof. o 

Proposition 26 Let T, be a formula from PROPps and V a subset of PS. E is relevant 
to V if and only if S is Var-dependent on V. 

Proof: The proof is trivial from Proposition 9. o 

Proposition 27 Let T, be a formula from PROPps and V a subset of PS. S is strictly 
relevant to V if and only ifT, is Var-dependent on V and Var-independent from Var{T,)\V . 

Proof: Easy from the definition of strict relevance, plus Proposition 9 (S is Var-dejiendent 
from every variable occurring in a prime implicate of S and Var-independent from all the 
remaining variables). o 

Proposition 28 

(1) STRICT RELEVANCE OF A FORMULA TO A SUBJECT MATTER (Lakemeyer, 1995) is E^- 
complete. 

(2) STRICT RELEVANCE OF A FORMULA TO A SUBJECT MATTER (Lakemeyer, 1997) is BH2- 
complete. 

Proof: 

• strict relevance (Lakemeyer, 1995) 

— Membership. Let us consider the complementary problem. Guess a clause 7, 
check that it does not contain any variable from V (this can be achieved in time 
polynomial in I7I + \V\, hence in time polynomial in |S| + \V\ since no prime 
implicate of S can include a variable that does not occur in S). Then check that 
it is an implicate of S (one call to an NP oracle) and check that every subclause 
of 7 obtained by removing from it one of its k literals is not an implicate of S 
{k calls to an NP oracle). Since only k + 1 calls to such an oracle are required 
to check that 7 is a prime implicate of S, the complementary problem of strict 
RELEVANCE belongs to Hence, strict relevance belongs to H2. 

— Hardness. Let {A,B} be a partition of Far(S) (for any formula S). yABBT, is 
valid if and only if every prime implicate of S that contains a variable from A 
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also contains a variable from B if and only if every prime implicate of S contains 
a variable from B (since Far(E) = AU B) if and only if S is strictly relevant to 
B. 

• strict relevance (Lakemeyer, 1997) 

— Membership: Straightforward from Propositions 27 and 10. 

— Hardness: By exhibiting a polynomial reduction from SAT-UNSAT to STRICT REL- 
EVANCE OF A FORMULA TO A SUBJECT MATTER. To any pair (0, ip) of propo- 
sitional formulas, let rename{i]j) be a formula obtained from ijj by renaming its 
variables. Obviously, 

(i) rename{ip) is satisfiable if and only if tp is. 
Now, let new be a new variable, let 

Ti = (f) A new A -^r-enarne{ip) 
and V = Var{(j)) U {new}. By Proposition 8, S is Var-dependent on V if and 
only if there is a prime implicant of E mentioning a variable from V, i.e., if and 
only if A -^rename{ip) is satisfiable, thus, using (i): 

(ii) E is Var-dependent on V if and only if both (/) and -it/; are satisfiable. 
Then, again after Proposition 8, E is Var-independent from Var{T,) \V = 
Var{rename{'ip)) if and only if no prime implicant of E mentions a variable 
from V ar {rename {iIj)), i.e., if and only if rename('0) is unsatisfiable, thus, using 

(i): 

(iii) E is Var-independent from Var{T,) \ V if and only if if) is satisfiable. 
Thus, from Proposition 27, (ii) and (iii), we get that E is strictly relevant to V 
if and only if (p is satisfiable and rename{ip) is not. 
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Appendix B: Glossary 

Here is a small glossary of useful terms with the place where their definition can be found. 



PROPv 
Lv 

Lv 

Far(E) 

NNF 

Lit{T,) 

OJv 

nv 

CjJ 

Mod{T.) 
for{S) 

'^1<-1 

Force(co, I) 
/P(S) 
P/(S) 
BH2, C0BH2 

\P yP ttP 



I H- S 

DepLit{T) 

DepL'it{T.) 
Lit- simplified 
Var-simplified 
ForgetLtt{T.,L) 
ForgetVar{T,,L) 



propositional language generated by V 

set of literals built up from V 

set of positive literals built up from V 

set of negative literals built up from V 

set of propositional variables appearing in S 

negation normal form 

set of literals occurring in the NNF of S 

a F-world (instantiations of all variables of V) 

set of all F- worlds 

world (full instanciation) 

set of models of S 

formula such that Moc?(S) = S 



set of prime implicates of E 
set of prime implicants of S 



syntactical Lit-dependence 
syntactical Var-dependence 
(semantical) Lit-dependence 
literals such that I i-> S 
(semantical) Var-dependence 
variables such that / 1-^ S 



literal forgetting 

variable forgetting 

Lit-equivalence 

Var-equi valence 

influenceability 

relevance to a subject matter 

strict relevance to a subject matter 



Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.1 
Section 2.2 
Section 2.2 
Definition 1 
Definition 2 
Definition 3 
Definition 3 
Definition 4 
Definition 4 
Definition 6 
Definition 6 
Definition 7 
Definition 8 
Definition 9 
Definition 9 
Definition 10 
Definition 11 
Definition 12 
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